Substrate processing system and substrate processing method, and device manufacturing method

ABSTRACT

A lithography system is provided with: a measurement device measuring position information of marks on a substrate held in a first stage; and an exposure apparatus on a second stage, the substrate for which the position information measurement for the marks has been completed, performs alignment measurement to measure position information for part of marks selected from among the marks on the substrate, and performs exposure. The measurement device measures position information of many marks on the substrate to obtain higher-degree components of correction amounts of an arrangement of divided areas, and the exposure apparatus measures position information of a small number of marks on the substrate to obtain lower-degree components of the correction amounts of the arrangement of the divided areas and exposes the plurality of divided areas while controlling the position of the substrate by using the obtained lower-degree components and the higher-degree components obtained by the measurement device.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation of International Application PCT/JP2016/055133, with an international filing date of Feb. 23, 2016, the disclosure of which is hereby incorporated herein by reference in its entirety, which was not published in English.

BACKGROUND OF THE INVENTION Field of the Invention

The present invention relates to substrate processing systems and substrate processing methods, and device manufacturing methods, and more particularly to a substrate processing system and a substrate processing method in which a substrate having a divided area along with at least one mark formed serves as a processing target, and a device manufacturing method using the substrate processing system.

Description of the Background Art

In a lithography process to manufacture semiconductor devices or the like, the devices are formed by overlaying multilayered patterns on a substrate such as a wafer or a glass plate (hereinafter generally referred to as a wafer), however, when the overlay accuracy between each of the layers is poor, the semiconductor devices or the like will not be able to demonstrate predetermined circuit characteristics and in some cases turn out to be defective products. Therefore, normally, marks (alignment marks) are formed in advance in each of the plurality of shot areas on the wafer, and position (coordinate value) of the marks on a stage coordinate system of the exposure apparatus is detected. Thereafter, based on this mark position information and a known position information of a pattern to be formed anew (e.g. a reticle pattern), wafer alignment to align one shot area on the wafer with respect to the pattern is performed.

As a method of wafer alignment, enhanced global alignment (EGA) has become the main stream in which alignment marks of only several shot areas (referred to as sample shot areas or alignment shot areas) on the wafer are detected in balance with throughput and arrangement of shot areas on the wafer is calculated using a statistical technique.

However, in the case of performing overlay exposure on the wafer in the lithography process, the wafer that has gone through the processing process such as resist coating, development, etching, CVD (chemical vapor deposition), and CMP (chemical mechanical polishing) may have distortion in the arrangement of the shot areas in the previous layer induced by the process, which may be a cause that degrades overlay accuracy. In view of such points, recent exposure apparatuses have functions such as a grid correction function that corrects not only a primary component of the wafer, but also a nonlinear component or the like of the shot arrangement that occurs induced by the process (for example, refer to U.S. Patent Application Publication No. 2002/0042664).

However, requirements for overlay accuracy are gradually becoming strict with higher integrated circuits, and it is essential to increase the number of sample shot areas, that is, to increase the number of marks to be detected to perform correction with higher precision. Also, in a twin stage type exposure apparatus having two stages on which wafers are mounted, in parallel to exposure performed on a wafer on one stage, since operations such as detection of marks on the wafer can be executed on the other stage, it becomes possible to increase the number of sample shot areas without reducing the throughput as much as possible.

But now, requirements for overlay accuracy are becoming stricter, and it is becoming difficult even in a twin stage type exposure apparatus to perform detection of a sufficient number of marks to achieve the required overlay accuracy without reducing the throughput.

SUMMARY OF THE INVENTION

According to a first aspect, there is provided a substrate processing system in which a substrate having a plurality of divided areas formed along with at least a mark serves as a processing target, comprising: a measurement device that has a first stage which can hold the substrate, the device measuring position information of a plurality of the marks on the substrate held on the first stage; and an exposure apparatus that has a second stage on which the substrate having completed measurement of position information of the plurality of marks by the measurement device is mounted, the apparatus performing a measurement operation of measuring position information of part of the marks of the plurality of marks on the substrate mounted on the second stage and an exposure operation of exposing the plurality of divided areas with an energy beam, wherein the measurement device obtains a first information concerning arrangement of the plurality of divided areas on the substrate, using the position information of the plurality of marks that has been measured, and the exposure apparatus obtains a second information concerning arrangement of the plurality of divided areas on the substrate using the position information of part of the marks that has been measured, and controls position of the second stage on the exposure operation based on the first information obtained by the measurement device and the second information.

According to a second aspect, there is provided a device manufacturing method, including: exposing a substrate using the substrate processing system according to the first aspect; and developing the substrate that has been exposed.

According to a third aspect, there is provided a substrate processing method in which a substrate having a plurality of divided areas formed along with at least a mark serves as a processing target, comprising: measuring position information of a plurality the marks on the substrate held on a first stage; and performing a measurement operation of making the substrate that has completed measurement of position information of the plurality of marks by the measuring be held by a second stage, and measuring position information of part of the marks of the plurality of marks on the substrate held by the second stage, and an exposure operation of exposing the plurality of divided areas with an energy beam, wherein in the measuring, a first information concerning arrangement of the plurality of divided areas on the substrate is obtained using the position information of the plurality of marks measured, and in performing the measurement operation and the exposure operation, a second information concerning arrangement of the plurality of divided areas on the substrate using the position information of part of the marks measured, and based on the first information obtained by the measuring and the second information, controlling position of the second stage when performing the exposure operation.

BRIEF DESCRIPTION OF DRAWINGS

In the accompanying drawings;

FIG. 1 schematically shows a total structure of a lithography system according to an embodiment;

FIG. 2 is a perspective view schematically showing a structure of a measurement device according to an embodiment;

FIG. 3A is a front view of the measurement device in FIG. 2 (a view from a −Y direction) partially omitted, and FIG. 3B is a partially omitted sectional view of the measurement device sectioned at an XZ plane that passes through an optical axis AX1;

FIG. 4 shows a partially omitted sectional view of the measurement device sectioned at a YZ plane that passes through optical axis AX1 of a mark detection system;

FIG. 5A is a perspective view showing a head section of a first position measurement system, and FIG. 5B is a planar view (a view from a +Z direction) of the head section of the first position measurement system;

FIG. 6 is a view used to describe a structure of a second position measurement system;

FIG. 7 shows a block diagram of an input-output relation of a controller that mainly structures a control system of the measurement device according to the first embodiment;

FIG. 8 is a flowchart corresponding to a processing algorithm of the controller when processing wafers of a lot;

FIG. 9 is a view schematically showing a structure of an exposure apparatus in FIG. 1;

FIG. 10 is a block diagram showing an input-output relation of an exposure controller that the exposure apparatus is equipped with;

FIG. 11 is a is a view schematically showing an entire structure of a lithography system according to a modified example; and

FIG. 12A is a view used when describing an example of a higher-degree partial differentiation correction, and FIG. 12B is a view used when describing an example of a first-degree approximation correction.

DESCRIPTION OF EMBODIMENTS

Hereinafter, an embodiment will be described on the basis of FIGS. 1 to 10. FIG. 1 schematically shows a structure of a lithography system (substrate processing system) 1000 according to a first embodiment.

Lithography system 1000, as is shown in FIG. 1, is equipped with a measurement device 100, an exposure apparatus 200, and a substrate processing device 300 that are connected in-line. Here, as substrate processing device 300, since a coater developer (C/D) is used, hereinafter it will be described also as C/D300, as appropriate. Lithography system 1000 is installed in a clean room. Note that connected in-line means that different devices are connected to one another in a state where a carrier path of the wafer (substrate) is connected, and in the description, terms “connected in-line” and “in-line connection” shall be used in this meaning.

In a general lithography system, as is disclosed in, for example, U.S. Pat. No. 6,698,944 and the like, an in-line interface section having a wafer carrier system inside a chamber for connecting in-line the exposure apparatus and the substrate processing device (C/D) is placed therebetween. Meanwhile, as it can be seen in FIG. 1, in lithography system 1000 related to the embodiment, instead of the in-line interface section, measurement device 100 is placed in between exposure apparatus 200 and C/D 300.

Exposure apparatus 200, C/D 300 and measurement device 100 that lithography system 1000 is equipped with all have a chamber, and the chambers are placed adjacent to one another. An exposure controller 220 that exposure apparatus 200 has, a coater/developer controller 320 that C/D 300 has, and controller 60 that measurement device 100 has are connected to one another via a local area network (LAN) 500, and communication is performed between the three controllers. A storage device 400 is also connected to LAN 500.

First of all, measurement device 100 will be described. FIG. 2 schematically shows a structure of measurement device 100 in a perspective view. Note that although measurement device 100 shown in FIG. 2 is actually structured by a chamber and component parts housed inside the chamber, description related to the chamber will be omitted in the description below. In measurement device 100 according to the embodiment, a mark detection system MDS is provided as it will be described later on, and in the description below, a direction of an optical axis AX1 of mark detection system MDS will be described as a Z-axis direction, a direction in which a movable stage to be described later on moves in long strokes within a surface orthogonal to the Z-axis direction will be described as a Y-axis direction, and a direction orthogonal to the Z-axis and the Y-axis will be described as an X-axis direction, and rotation (inclination) directions around the X-axis, the Y-axis, and the Z-axis will be described as θx, θy, and θz directions, respectively. Mark detection system MDS, here, has an outer shape like a letter L when viewed from the side (e.g. when viewed from a +X direction) with a cylindrical barrel section 41 provided at the lower end (tip), and inside barrel section 41, an optical system (dioptric system) is housed consisting of a plurality of lens elements that have a common optical axis AX1 in the Z-axis direction. In the description, for convenience of explanation, optical axis AX1 of the dioptric system inside barrel section 41 is referred to as optical axis AX1 of mark detection system MDS.

FIG. 3A shows a front view of measurement device 100 in FIG. 2 (a view from a −Y direction) partially omitted, and FIG. 3B shows a sectional view partially omitted of measurement device 100 sectioned at an XZ plane that passes through optical axis AX1. FIG. 4 shows a sectional view partially omitted of measurement device 100 sectioned at a YZ plane that passes through optical axis AX1.

Measurement device 100, as is shown in FIG. 2, is equipped with a surface plate 12 that has an upper surface almost parallel to an XY plane orthogonal to optical axis AX, a wafer slider (hereinafter shortly referred to as a slider) 10 arranged on surface plate 12 movable in predetermined strokes in the X-axis and Y-axis directions and can also finely move (infinitesimal displacement) in the θx, θy and θz directions holding a wafer W with respect to surface plate 12, a drive system 20 that moves slider 10 (not shown in FIG. 2, refer to FIG. 7), a first position measurement system 30 (not shown in FIG. 2, refer to FIGS. 4 and 7) that measures position information of slider 10 with respect to surface plate 12 in each of the X-axis, the Y-axis, the Z-axis, the θx, the θy, and the θz directions (hereinafter described as directions of six degrees of freedom), a measurement unit 40 that has mark detection system MDS to detect a mark on wafer W loaded on (held by) slider 10, a second position measurement system 50 (not shown in FIG. 2, refer to FIG. 7) that measures relative position information between mark detection system MDS (measurement unit 40) and surface plate 12, and a controller 60 (not shown in FIG. 2, refer to FIG. 7) that acquires measurement information with the first position measurement system 30 and measurement information with the second position measurement system 50 while controlling the drive of slider 10 with drive system 20 and obtaining position information of a plurality of marks on wafer W held by slider 10 using mark detection system MDS.

Surface plate 12 consists of a rectangular solid member with a rectangular shape (or a square shape) in a planar view, and its upper surface is finished to have an extremely high flatness so that a guide surface is formed for slider 10 when the slider moves. As the material for surface plate 12, a low thermal expansion coefficient material also called a zero thermal expansion material is used such as, e.g. an invar alloy, an extremely low expansion cast steel, or an extremely low expansion glass ceramics.

Surface plate 12 has a cutout shaped space 12 a whose bottom section is open formed at a total of three places; one at the center in the X-axis direction at a surface on a −Y side, and one each at both ends in the X-axis direction at a surface on a +Y side. Of the three spaces 12 a, FIG. 1 shows space 12 a formed at the surface on the −Y side. Inside each space 12 a, a vibration isolator 14 is arranged. Surface plate 12 is supported at three points by three vibration isolators 14 on an upper surface parallel to an XY plane of a base frame 16 having a rectangular shape in a planar view installed on a floor F, so that the upper surface becomes almost parallel with the XY plane. Note that the number of vibration isolators 14 is not limited to three.

Slider 10, as is shown in FIG. 4, has a total of four air hydrostatic bearings (air bearings) 18, each attached at the four corners to the bottom surface in a state with each bearing surface almost coplanar with the lower surface of slider 10, and by static pressure (pressure in gap) between the bearing surface and the upper surface (guide surface) of surface plate 12 of pressurized air blowing out toward surface plate 12 from the four air bearings 18, slider 10 is supported by levitation via a predetermined clearance (air-gap, gap), e.g. a clearance of several μm, on the upper surface of surface plate 12. In the embodiment, slider 10 uses zero thermal expansion glass (e.g. Zerodur of Schott AG) as its material which is a kind of zero thermal expansion material.

In the upper part of slider 10, a recess section 10 a of a predetermined depth is formed that has a circular shape in a planar view whose inner diameter is slightly larger than the diameter of wafer W, and inside recess section 10 a, a wafer holder WH is arranged whose diameter is almost the same as the diameter of wafer W. As wafer holder WH, while a vacuum chuck, an electrostatic chuck, or a mechanical chuck can be used, a vacuum chuck of a pin chuck method is to be used as an example. Wafer W is held by suction by wafer holder WH in a state where its upper surface is almost flush with the upper surface of slider 10. In wafer holder WH, a plurality of suction ports are formed and the plurality of suction ports are connected to a vacuum pump 11 (refer to FIG. 6) via a vacuum piping system not shown. And operations such as on/off of vacuum pump 11 are controlled by controller 60. Note that one of, or both of slider 10 and wafer holder WH may be referred to as a “first substrate holding member”.

Slider 10 also has a vertical movement member (not shown) which moves vertically, for example, via three circular openings formed in wafer holder WH, and loads the wafer together with a wafer carrier system 70 (not shown in FIG. 2, refer to FIG. 7) onto wafer holder WH as well as unload the wafer from wafer holder WH. A driver 13 that moves the vertical movement member is controlled by controller 60 (refer to FIG. 7).

In the embodiment, as wafer holder WH, a holder with a size that can hold by suction a 300 mm wafer having a diameter of 300 mm is to be used as an example. Note that in the case wafer carrier system 70 has a non-contact holding member that holds by suction the wafer on wafer holder WH from above in a non-contact manner such as a Bernoulli chuck, slider 10 does not require the vertical movement member and therefore the circular opening for wafer holder WH also does not have to be formed.

As is shown in FIGS. 3B and 4, on the lower surface of slider 10 in an area slightly larger than wafer W, a two-dimensional grating (hereinafter simply referred to as grating) RG1 is placed horizontally (parallel to the wafer W surface). Grating RG1 includes a reflection type diffraction grating (X diffraction grating) whose periodic direction is in the X-axis direction and a reflective diffraction grating (Y diffraction grating) whose periodic direction is in the Y-axis direction. The X diffraction grating and the Y diffraction grating have grid lines whose pitch is set, for example, to 1 μm.

Vibration isolator 14 is an active type vibration isolation system (so-called AVIS (Active Vibration Isolation System)) that is equipped with an accelerometer, a displacement sensor (e.g. a capacitive sensor), an actuator (e.g. a voice coil motor), an air mount which functions as an air damper, and the like. Vibration isolator 14 can attenuate vibration of relatively high frequency with the air mount (air damper) and can also isolate vibration (control vibration) with the actuator. Consequently, vibration isolator 14 can avoid vibration from traveling between surface plate 14 and base frame. Note that a hydraulic power damper may be used instead of the air mount (air damper).

Here, the reason why the actuator is provided in addition to the air mount is because since the internal pressure of the gas within the gas chamber of the air mount is high, control response can be secured only to around 20 Hz, therefore, when control of high response is necessary, the actuator has to be controlled according to the output of the accelerometer not shown. However, fine vibration such as floor vibration is isolated by the air mount.

The upper end surface of vibration isolator 14 is connected to surface plate 12. Air (e.g. compressed air) can be supplied to the air mount via a gas supply port not shown, and the air mount expands/contracts in predetermined strokes (e.g. around 1 mm) in the Z-axis direction according to the amount of gas (pressure change of the compressed air) filled inside the air mount. Therefore, by vertically moving individually from below the three places of surface plate 12 using the air mounts that each of the three vibration isolators 14 have, position in the Z-axis direction, the θx direction, and the θy direction of surface plate 12 and slider 10 supported by levitation on the surface plate can be adjusted arbitrarily. The actuator of vibration isolator 14 not only moves surface plate 12 in the Z-axis direction, but also can drive the surface plate in the X-axis direction and the Y-axis direction. Note that drive quantity in the X-axis direction and the Y-axis direction is smaller than the drive quantity in the Z-axis direction. The three vibration isolators 14 are connected to controller 60 (refer to FIG. 6). Note that each of the three vibration isolators 14 may be equipped with an actuator that can move surface plate 12 not only in the X-axis direction, the Y-axis direction, and the Z-axis direction, but also in, e.g. directions of six degrees of freedom. Controller 60 at all times controls the actuators of the three vibration isolators 14 real time so that position in directions of six degrees of freedom of surface plate 12 to which a head section 32 of the first position measurement system 30 to be described later on is fixed maintains a desired positional relation with respect to mark detection system MDS, based on relative position information between mark detection system MDS (measurement unit 40) and surface plate 12 measured by the second position measurement system 50. Note that feedforward control can be performed on each of the three vibration isolators 14. For example, controller 60 may perform feedforward control on each of the three vibration isolators 14 based on measurement information of the first position measurement system 30. Control of vibration isolator 14 by controller 60 is to be described further later on.

Drive system 20, as is shown in FIG. 7, includes a first driver 20A that moves slider 10 in the X-axis direction and a second driver 20B that moves slider 10 in the Y-axis direction integrally with the first driver 20A.

As it can be seen from FIGS. 2 and 4, on a side surface at the −Y side of slider 10, a pair of movers 22 a each consisting of a magnet unit (or a coil unit) having an inverted L-shape in a side view is fixed at a predetermined spacing in the X-axis direction. On the side surface at the +Y side of slider 10, as is shown in FIG. 4, a pair of movers 22 b (mover 22 b at the +X side is not shown) each consisting of a magnet unit (or a coil unit) is fixed at a predetermined spacing in the X-axis direction. Although the pair of movers 22 a and the pair of movers 22 b are placed symmetrical, they have a structure similar to one another.

Movers 22 a and 22 b, as is shown in FIGS. 2 to 4, are placed a predetermined distance apart in the Y-axis direction structuring a part of movable stage 24 which has a rectangular frame shape in a planar view, and is supported in a non-contact manner on an upper surface substantially parallel to an XY plane of a pair of plate members 24 a and 24 b that each extend in the X-axis direction. That is, at a lower surface of movers 22 a and 22 b (a surface that face plate members 24 a and 24 b, respectively) air bearings (not shown) are provided, and by a levitation force (static pressure of pressurized air) generated to plate members 24 a and 24 b generated with these air bearings, movers 22 a and 22 ba are supported in a non-contact manner from below by movable stage 24. Note that self-weight of slider 10 to which each pair of movers 22 a and 22 b are fixed is supported by the levitation force that the four air bearings 18 generate with respect to surface plate 12, as is previously described.

On the upper surface of each of the pair of plate members 24 a and 24 b, as is shown in FIGS. 2 to 4, stators 26 a and 26 b consisting of a magnet unit (or a coil unit) are placed in an area excluding both ends in the X-axis direction.

Electromagnetic interaction between the pair of movers 22 a and stator 26 a generate a movement force (electromagnetic force) for driving the pair of movers 22 a in the X-axis direction and a movement force (electromagnetic force) for driving the pair of movers 22 a in the Y-axis direction, and electromagnetic interaction between the pair of movers 22 b and stator 26 b generate a movement force (electromagnetic force) for driving the pair of movers 22 b in the X-axis direction and a movement force (electromagnetic force) for driving the pair of movers 22 b in the Y-axis direction. That is, the pair of movers 22 a and stator 26 a structure an XY linear motor 28A that generates a movement force in the X-axis direction and the Y-axis direction, the pair of movers 22 b and stator 26 b structure an XY linear motor 28B that generates a movement force in the X-axis direction and the Y-axis direction, and XY linear motor 28A and XY linear motor 28B structure the first driver 20A that moves slider 10 with predetermined strokes in the X-axis direction as well as finely moves the slider in the Y-axis direction (refer to FIG. 7). The first driver 20A can move slider 10 in the θz direction by making the magnitude of each of the movement forces in the X-axis direction generated by XY linear motor 28A and XY linear motor 28B different. The first driver 20A is controlled by controller 60 (refer to FIG. 7). In the embodiment, while the first driver 20A generates not only a movement force in the X-axis direction but also a movement force in the Y-axis direction from the relation of structuring a coarse/fine movement drive system that moves slider 10 in the Y-axis direction with the first driver 20A as well as the second driver to be described later on, the first driver 20A does not necessarily have to generate the movement force in the Y-axis direction.

Movable stage 24 has the pair of plate members 24 a and 24 b and a pair of connecting members 24 c and 24 d placed a predetermined distance apart in the X-axis direction each extending in the Y-axis direction. A step section is formed at both ends in the Y-axis direction of connecting members 24 c and 24 d. Connecting members 24 c and 24 d and plate member 24 a are integrated in a state where one end and the other end in the longitudinal direction of plate member 24 a are mounted on the step sections at the −Y side of each of the connecting members 24 c and 24 d. Also, connecting members 24 c and 24 d and plate member 24 b are integrated in a state where one end and the other end in the longitudinal direction of plate member 24 b are mounted on the step sections at the +Y side of each of the connecting members 24 c and 24 d (refer to FIG. 2B). That is, in this manner, the pair of plate members 24 a and 24 b is connected with the pair of connecting members 24 c and 24 d to structure the frame shaped movable stage 24.

As is shown in FIGS. 2 and 3A, near both ends in the X-axis direction on the upper surface of base frame 16, a pair of linear guides 27 a and 27 b is fixed extending in the Y-axis direction. Inside one of the linear guides 27 a positioned at the +X side, a stator 25 a (refer to FIG. 3B) of a Y-axis linear motor 29A consisting of a coil unit (or a magnet unit) that covers almost the total length in the Y-axis direction is housed on the upper surface and a surface near the −X side. Facing the upper surface and the surface near the −X side of linear guide 27 a, a mover 23 a is placed consisting of a magnet unit (or coil unit) having an L-shaped cross sectional surface that structures Y-axis linear motor 29A along with stator 25 a. To the lower surface and the surface at the +X side of mover 23 a that face the upper surface and the surface at the −X side of linear guide 27 a, respectively, air bearings are fixed that blow out pressurized air to the opposing surface. Of the air bearings, especially as the air bearings fixed to the surface at the +X side of mover 23 a, vacuum preloaded air bearings are used. The vacuum preloaded air bearings maintain a clearance (space, gap) in the X-axis direction between mover 23 a and linear guide 27 a at a constant value by balancing the static pressure of the pressurized air and the vacuum preload force between the bearing surface and linear guide 27 a.

On the upper surface of mover 23 a, a plurality of X guides 19 consisting of, for example, two rectangular solid members, are fixed spaced apart at a predetermined distance in the Y-axis direction. Each of the two X guides 19 is engaged in a non-contact manner with a slide member 21 having an inversed U sectional shape that structures a uniaxial guide device along with X guide 19. Air bearings are provided at each of the three surfaces of slide member 21 that face X guide 19.

The two slide members 21, as is shown in FIG. 2, are each fixed to the lower surface (surface at the −Z side) of connecting member 24 c.

The other linear guide 27 b positioned at the −X side houses inside a stator 25 b of a Y-axis linear motor 29B consisting of a coil unit (or a magnet unit), and is structured similar to linear guide 27 a except for being symmetric (refer to FIG. 3B). Facing the upper surface and the surface near the +X side of linear guide 27 b, a mover 23 b is placed consisting of a magnet unit (or coil unit) which is symmetric but has an L-shaped cross sectional surface similar to mover 23 a that structures Y-axis linear motor 29B along with stator 25 b. Facing each of the upper surface and the surface at the +X side of linear guide 27 b, air bearings are fixed to each of the lower surface and the surface at the −X side of mover 23 b, and especially as the air bearings fixed to the surface at the −X side of mover 23 b, vacuum preloaded air bearings are used. By the vacuum preloaded air bearings, the clearance (void, gap) in the X-axis direction between mover 23 b and linear guide 27 b is maintained at a constant value.

Between the upper surface of mover 23 b and the bottom surface of connecting member 24 d, as is previously described, two uniaxial guide devices structured by X guide 19 and slide member 21 that engages with X guide 19 in a non-contact manner are provided.

Movable stage 24 is supported from below by movers 23 a and 23 b via two each of (a total of four) uniaxial guide devices on the +X side and the −X side, and is movable in the X-axis direction on mover 23 a and 23 b. Therefore, by the first driver 20A previously described, when slider 10 is moved in the X-axis direction, reaction force of the movement force acts on movable stage 24 in which stators 26 a and 26 b are provided and movable stage 24 moves in a direction opposite to slider 10 according to the momentum conservation law. That is, the movement of movable stage 24 prevents (or effectively suppresses) generation of vibration caused by the reaction force of the movement force in the X-axis direction to slider 10. That is, movable stage 24 functions as a counter mass when slider 10 moves in the X-axis direction. However, movable stage 24 does not necessarily have to function as a counter mass. Note that a counter mass may be provided to prevent (or effectively suppress) generation of vibration caused by the movement force to move slider 10 in the Y-axis direction with respect to movable stage 24, although it is not provided here in particular since slider 10 only moves finely in the Y-axis direction with respect to movable stage 24.

Y-axis linear motor 29A generates a movement force (electromagnetic force) that moves mover 23 a in the Y-axis direction by electromagnetic interaction between mover 23 a and stator 25 a, and Y-axis linear motor 29B generates a movement force (electromagnetic force) that moves mover 23 b in the Y-axis direction by electromagnetic interaction between mover 23 b and stator 25 b.

The movement force in the Y-axis direction that Y-axis linear motors 29A and 29B generate acts on movable stage 24 via two each of the uniaxial guide devices at the +X side and the −X side. This allows slider 10 to be moved in the Y-axis direction integrally with movable stage 24. That is, in the embodiment, movable stage 24, the four uniaxial guide devices, and the pair of Y-axis linear motors 29A and 29B structure the second driver 20B (refer to FIG. 7) that moves slider 10 in the Y-axis direction.

In the embodiment, the pair of Y-axis linear motors 29A and 29B is physically separated from surface plate 12 and is also vibrationally separated by the three vibration isolators 14. Note that linear guides 27 a and 27 b in which stators 25 a and 25 b of the pair of Y-axis linear motors 29A and 29B provided may be structured movable in the Y-axis direction with respect to base frame 16, so that the linear guides may function as a counter mass when driving slider 10 in the Y-axis direction.

Measurement unit 40, as is shown in FIG. 2, has a unit main section 42 that has a cutout shaped space 42 a having an opening at a bottom section formed at a surface on the −Y side, mark detection system MDS previously described connected to unit main section 42 in a state where a base end is inserted into space 42 a, and a connection mechanism 43 that connects barrel section 41 at the tip of mark detection system MDS to unit main section 42.

Connection mechanism 43 includes a support plate 44 that supports barrel section 41 from the back side (the +Y side) via a mounting member (not shown), and a pair of support arms 45 a and 45 b whose one end respectively supports support plate 44 and the other end is respectively fixed to the bottom surface of unit main section 42.

In the embodiment, corresponding to the point that a sensitive agent (resist) is coated on the upper surface of the wafer held on slider 10, a system using a detection beam having a wavelength that is not sensitive to the resist is used as mark detection system MDS. As mark detection system MDS, for example, an FIA (Field Image Alignment) system of an image processing method is used that irradiates a broadband detection beam which does not expose the resist coated on the wafer on a target mark, images an image of the target mark formed on a light receiving surface by the reflection light from the target mark and an image of an index (not shown) (an index pattern on an index plate provided inside) using an imaging device (such as a CCD), and outputs their imaging signals. The imaging signals from mark detection system MDS are supplied (refer to FIG. 7) to controller 60 via a signal processor 49 (not shown in FIG. 2, refer to FIG. 7). Mark detection system MDS has an alignment auto-focus function that adjusts the focal position of the optical systems.

Between barrel section 41 and support plate 44, as is shown in FIG. 2, a head mounting member 51 with a rough isosceles triangle shape is placed. In head mounting member 51, an opening section penetrating in the Y-axis direction of FIG. 2 is formed and barrel section 41 is attached to (fixed to) support plate 44 via the mounting member (not shown) inserted in the opening section. Head mounting member 51 also has its rear surface fixed to support plate 44. In this manner, barrel section 41 (mark detection system MDS), head mounting member 51, and support plate 44 are integrated with unit main section 42, via the pair of support arms 45 a and 45 b.

Inside unit main section 42, signal processor 49 and the like previously described are placed that performs processing on the imaging signals output as detection signals from mark detection system MDS, calculates position information of the target mark with respect to the detection center, and outputs the information to controller 60. Unit main section 42 is supported at three points from below via, e.g. three vibration isolators 48 on a support frame 46 having a portal shape when viewed from the −Y side installed on base frame 16. Each vibration isolator 48 is an active type vibration isolation system (a so-called AVIS (Active Vibration Isolation System) and is equipped with an accelerometer, a displacement sensor (e.g. a capacitive sensor or the like), an actuator (e.g. a voice coil motor or the like), a mechanical damper such as an air damper or a hydraulic damper and the like, and vibration isolator 48 can attenuate relatively high frequency vibration with the mechanical damper as well as isolate vibration (control vibration) with the actuator. Consequently, each vibration isolator 48 can avoid relatively high frequency vibration from traveling between support frame 46 and unit main section 42.

Note that mark detection system MDS is not limited to the FIA system, and for example, a diffracted light interference type alignment detection system may also be used that irradiates a coherent detection light on the subject mark, makes two diffracted lights (e.g. diffracted lights of the same order or diffracted lights diffracted in the same direction) generated from the target mark interfere with each other, and detects the interfered light and outputs the detection signals, instead of the FIA system. Or, the diffracted light interference type alignment system may be used with the FIA system and the two target marks may be detected simultaneously. Furthermore, as mark detection system MDS, a beam scan type alignment system that scans a measurement beam in a predetermined direction with respect to a target mark while slider 10 is moved in a predetermined direction may also be used. Also, in the embodiment, while mark detection system MDS has the alignment auto-focus function, instead of or in addition to this, measurement unit 40 may be equipped with a focal position detection system such as a multi-point focal position detection system of an oblique incidence method having a structure similar to the one disclosed in, for example, U.S. Pat. No. 5,448,332.

The first position measurement system 30, as is shown in FIGS. 3B and 4, is placed within a recess section formed on the upper surface of surface plate 12 and has head section 32 fixed to surface plate 12. The upper surface of head section 32 faces the lower surface of slider 10 (forming surface of grating RG1). A predetermined clearance (void, gap), e.g. a clearance of several mm, is formed between the upper surface of head section 32 and the lower surface of slider 10.

The first position measurement system 30, as is shown in FIG. 7, is equipped with an encoder system 33 and a laser interferometer system 35. Encoder system 33 can acquire position information of slider 10 by irradiating a plurality of beams from head section 32 on a measurement section (forming surface of grating RG1) on the lower surface of slider 10 as well as receiving a plurality of return beams (e.g. a plurality of diffracted beams from grating RG1) from the measurement section on the lower surface of slider 10. Encoder system 33 includes an X linear encoder 33 x which measures position in the X-axis direction of slider 10 and a pair of Y linear encoders 33 ya and 33 yb which measure position in the Y-axis direction of slider 10. In encoder system 33, a head of a diffraction interference type having a structure similar to the encoder head disclosed in, for example, U.S. Pat. No. 7,238,931, U.S. Patent Application Publication No. 2007/0288121 and the like (hereinafter shortly referred to as an encoder head as appropriate) is used. Note that while a head includes a light source, a light receiving system (including a photodetector), and an optical system, in the embodiment, of these parts, only at least the optical system has to be placed inside the housing of head section 32 facing grating RG1, and at least one of the light source and the light receiving system may be placed outside of the housing of head section 32.

FIG. 5A shows head section 32 in a perspective view, and FIG. 5B shows the upper surface of head section 32 in a planar view when viewed from a +Z direction. Encoder system 33 measures the position in the X-axis direction of slider 10 with one X head 37 x, and measures the position in the Y-axis direction with a pair of Y heads 37 ya and 37 yb (refer to FIG. 5B). That is, X linear encoder 33 x previously described is structured by X head 37 x which measures the position in the X-axis direction of slider 10 using an X diffraction grating of grating RG1, and the pair of Y linear encoders 33 ya and 33 yb is structured by the pair of Y heads 37 ya and 37 yb which measure the position in the Y-axis direction of slider 10 using a Y diffraction grating of grating RG1.

As is shown in FIGS. 5A and 5B, on a straight line LX which passes through the center of head section 32 and is parallel to the X-axis, X head 37 x irradiates measurement beams LBx₁ and LBx₂ (indicated by a solid line in FIG. 4A) on the same irradiation point on grating RG1 from two points (refer to white circles in FIG. 5B) equidistant from a straight line CL which passes through the center of head section 32 and is parallel to the Y-axis. Position in the X-axis direction and the Y-axis direction of the irradiation points of measurement beams LBx₁ and LBx₂, that is, detection points of X head 37 x (refer to reference code DP in FIG. 5B), coincides with the detection center of mark detection system MDS.

Here, measurement beams LBx₁ and LBx₂ are beams on which polarized beam splitting is performed by a polarized beam splitter (not shown) on a beam from a light source, and when measurement beams LBx₁ and LBx₂ are irradiated on grating RG1, diffracted beams of a predetermined order of these measurement beams LBx₁ and LBx₂ diffracted by the X diffraction grating, e.g. a first order diffraction beam (a first diffraction beam), are each returned by a reflection mirror via a lens and a quarter wavelength plate (not shown), and by the beams passing through the quarter wavelength plate twice the polarization direction is rotated by 90 degrees which allows the beams to pass through the original optical path and re-enter the polarized beam splitter where the beams are coaxially synthesized, and then by the photodetector (not shown) receiving the interference light of the first order diffraction beams of measurement beams LBx₁ and LBx₂, position in the X-axis direction of slider 10 is measured.

As is shown in FIG. 5B, the pair of Y heads 37 ya and 37 yb are placed on the +X side and the −X side of straight line CL, respectively. Y head 37 ya, as is shown in FIGS. 5A and 5B, irradiates measurement beams LBya₁ and LBya₂ each indicated by broken lines in FIG. 5A on a common irradiation point on grating RG1 from two points (refer to white circles in FIG. 5B) equidistant from straight line LX on a straight line LYa. The irradiation point of measurement beams LBya₁ and LBya₂, that is, detection point of Y head 37 ya is indicated by reference code DPya in FIG. 5B.

Y head 37 yb irradiates measurement beams LByb₁ and LByb₂ on a common irradiation point DPyb on grating RG1 from two points (refer to white circles in FIG. 5B) symmetric to outgoing points of measurement beams LBya₁ and LBya₂ of Y head 37 ya with respect to straight line CL. As is shown in FIG. 5B, detection points DPya and DPyb of each of the Y heads 37 ya and 37 yb are placed on straight line LX parallel to the X-axis.

Measurement beams LBya₁ and LBya₂ are also beams of the same beam split by polarization by the polarized beam splitter, and by interference light of a predetermined order of these measurement beams LBya₁ and LBya₂ diffracted by the Y diffraction grating, e.g. a first order diffraction beam (a second diffraction beam) being photodetected by the photodetector (not shown) similar to the description above, position in the Y-axis direction of slider 10 is measured. For measurement beams LByb₁ and LByb₂ as well, position in the Y-axis direction of slider 10 is measured by interference light of a first order diffraction beam (a second diffraction beam) being photodetected by the photodetector (not shown), similar to measurement beams LBya₁ and LBya₂.

Here, controller 60 decides the position in the Y-axis direction of slider 10 based on an average of the measurement values of the two Y heads 37 ya and 37 yb. Consequently, in the embodiment, the position in the Y-axis direction of slider 10 is measured with a midpoint DP of detection points DPya and DPyb serving as a substantial measurement point. Midpoint DP coincides with the irradiation point on grating RG1 of measurement beams LBx₁ and LBX₂.

That is, in the embodiment, for measuring position information in the X-axis direction and Y-axis direction of slider 10, the device has a common detection point, and controller 60 controls this detection point so that the position within the XY plane coincides with the detection center of mark detection system MDS at, for example, a nm level, by controlling at all times the three vibration isolators 14 real time, based on relative position information between mark detection system MDS (measurement unit 40) and surface plate 12 measured by the second position measurement system 50. Consequently, in the embodiment, by using encoder system 33, controller 60 can always perform measurement of position information within the XY plane of slider 10 directly under (rear surface side of slider 10) the detection center of mark detection system MDS when measuring the alignment marks on wafer W mounted on slider 10. Controller 60 also measures the rotation quantity in the θz direction of slider 10, based on a difference between measurement values of the pair of Y heads 37 ya and 37 yb.

Laser interferometer 35 can acquire position information of slider 10, by making a measurement beam enter the measurement section (the surface on which grating RG1 is formed) on the lower surface of slider 10 along with receiving the return beam (e.g. reflection light from a surface on which grating RG1 is formed). Laser interferometer 35, as is shown in FIG. 5A, makes four measurement beams LBz₁, LBz₂, LBz₃, and LBz₄ enter the lower surface of slider 10 (the surface on which grating RG1 is formed). Laser interferometer system 35 is equipped with laser interferometers 35 a to 35 d (refer to FIG. 7) that irradiate the four measurement beams LBz₁, LBz₂, LBz₃, and LBz₄, respectively. In the embodiment, laser interferometers 35 a to 35 d structure four Z heads.

In laser interferometer system 35, as is shown in FIGS. 5A and 5B, four measurement beams LBz₁, LBz₂, LBz₃, and LBz₄ are emitted parallel to the Z-axis from four points corresponding to the four vertices of a square whose center is detection point DP and has two sides parallel to the X-axis and two sides parallel to the Y-axis. In this case, the outgoing points (irradiation points) of measurement beams LBz₁ and LBz₄ are at equal distances from straight line LX on straight line LYa, and the outgoing points (irradiation points) of the remaining measurement beams LBz₂ and LBz₃ are at equal distances from straight line LX on a straight line LYb. In the embodiment, the surface on which grating RG1 is formed also functions as a reflection surface of each measurement beam from laser interferometer system 35. Controller 60 measures information on the position in the Z-axis direction and the rotation quantity in the θx direction and the θy direction of slider 10, using laser interferometer system 35. Note that as it is obvious from the description above, although slider 10 is not positively moved by drive system 20 previously described with respect to surface plate 12 in the Z-axis, the θx and the θy directions, because slider 10 is supported by levitation on surface plate 12 by the four air bearings placed at the four corners of the bottom surface, the position of slider 10 actually changes on surface plate 12 in each of the Z-axis, the θx and the θy directions. That is, slider 10 is actually movable with respect to surface plate 12 in each of the Z-axis, the θx and the θy directions. Displacement in each of the θx and the θy directions in particular causes a measurement error (Abbe error) in encoder system 33. Taking such points into consideration, position information in each of the Z-axis, the θx and the θy directions of slider 10 is measured by the first position measurement system 30 (laser interferometer system 35).

Note that for measurement of information on position in the Z-axis direction and the rotation quantity in the θx direction and the θy direction of slider 10, since the beams only have to be incident on three different points on the surface where grating RG1 is formed, the Z heads, e.g. laser interferometers, that are necessary should be three. Note that a cover glass to protect grating RG1 can be provided on the lower surface of slider 10, and on the surface of the cover glass, a wavelength selection filter may be provided that allows each measurement beam from encoder system 33 to pass and prevents each measurement beam from laser interferometer system 35 from passing.

As it can be seen from the description so far, controller 60 can measure the position in directions of six degrees of freedom of slider 10 by using encoder system 33 and laser interferometer system 35 of the first position measurement system 30. In this case, in encoder system 33, influence of air fluctuation can almost be ignored since the optical path lengths of the measurement beams in the air are extremely short and are almost equal. Consequently, position information within the XY plane (including the θz direction) of slider 10 can be measured with high precision by encoder system 33. Also, because the substantial detection point on grating RG1 in the X-axis direction and the Y-axis direction by encoder system 33 and the detection point on the lower surface of slider 10 in the Z-axis direction by laser interferometer system 35 each coincide with the detection center of mark detection system MDS within the XY plane, generation of the so-called Abbe error which is caused by shift within the XY plane between the detection point and the detection center of mark detection system MDS can be suppressed to a level that can be ignored. Consequently, controller 60 can measure the position in the X-axis direction, the Y-axis direction, and the Z-axis direction of slider 10 without the Abbe error caused by shift in the XY plane between the detection point and the detection center of mark detection system MDS with high precision by using the first position measurement system 30.

However, for the Z-axis direction parallel to optical axis AX1 of mark detection system MDS, position information in the XY plane of slider 10 is not necessarily measured at a position at the surface of wafer W by encoder system 33, that is, the Z position of the placement surface of grating RG1 and the surface of wafer W do not necessarily coincide. Therefore, in the case grating RG1 (that is, slider 10) is inclined with respect to the XY plane, when slider 10 is positioned based on measurement values of each of the encoders of encoder system. 33, as a result, a positioning error (a kind of Abbe error) corresponding to the inclination with respect to the XY plane of grating RG1 occurs due to a Z position difference ΔZ (that is, positional displacement in the Z-axis direction between the detection point by encoder system 33 and the detection center (detection point) by mark detection system MDS) between the placement surface of grating RG1 and the surface of wafer W. However, this positioning error (position control error) can be acquired by a simple calculation by using difference ΔZ, pitching quantity θx, and rolling quantity θy, and using this as an offset and by setting the position of slider 10 based on position information after correction in which measurement values of (each encoder of) encoder system 33 are corrected by the offset amount, the kind of Abbe error described above no longer affects the measurement. Or, instead of correcting the measurement values of (each encoder of) encoder system 33, one or a plurality of information for moving the slider such as a target position to where slider 10 should be positioned may be corrected, based on the above offset.

Note that in the case grating RG1 (that is, slider 10) is inclined with respect to the XY plane, head section 32 may be moved so that a positioning error due to the inclination does not occur. That is, in the case an inclination has been measured in grating RG1 (that is, slider 10) with respect to the XY plane by the first position measurement system 30 (e.g. interferometer system 35), surface plate 12 that holds head section 32 may be moved, based on position information acquired using the first position measurement system 30. Surface plate 12, as is described above, can be moved using vibration isolators 14.

Also, in the case grating RG1 (that is, slider 10) is inclined with respect to the XY plane, position information of the mark acquired using mark detection system MDS may be corrected, based on the positioning error caused by the inclination.

The second position measurement system 50, as is shown in FIGS. 2, 3A and 3B, has a pair of head sections 52A and 52B provided at the lower surface of one end and the other end in the longitudinal direction of head mounting member 51 previously described, and scale members 54A and 54B that are placed facing head sections 52A and 52B. Scale members 54A and 54B have an upper surface which is the same height as the surface of wafer W held by wafer holder WH. On each of the upper surfaces of scale members 54A and 54B, reflection type two-dimensional gratings RG2 a and RG2 b are formed. Two-dimensional gratings (hereinafter shortly referred to as gratings) RG2 a and RG2 b both include a reflective diffraction grating (X diffraction grating) whose periodic direction is in the X-axis direction and a reflective diffraction grating (Y diffraction grating) whose periodic direction is in the Y-axis direction. Pitch of grid lines of the X diffraction grating and the Y diffraction grating is set, for example, to 1 μm.

Scale members 54A and 54B consist of a material having a low thermal expansion, e.g. a zero thermal expansion material, and are each fixed on surface plate 12 via support members 56, as is shown in FIGS. 3A and 3B. In the embodiment, dimensions of scale members 54A and 54B and support members 56 are decided so that gratings RG2 a and RG2 b face head sections 52A and 52B with a gap of around several mm in between.

As is shown in FIG. 6, one head section 52A fixed to the lower surface at the end on the +X side of head mounting member 51 includes an XZ head 58X₁ whose measurement direction is in the X-axis and the Z-axis directions and a YZ head 58Y₁ whose measurement direction is in the Y-axis and the Z-axis directions that are housed in the same housing. XZ head 58X₁ (to be more accurate, an irradiation point on grating RG2 a of the measurement beam emitted by XZ head 58X₁) and YZ head 58Y₁ (to be more accurate, an irradiation point on grating RG2 a of the measurement beam emitted by YZ head 58Y₁) are placed on the same straight line parallel to the Y-axis.

The other head section 52B is placed symmetric to head section 52A with respect to a straight line (hereinafter called a reference axis) LV which passes through optical axis AX1 of mark detection system MDS and is parallel to the Y-axis, however, the structure is similar to that of head section 52A. That is, head section 52B has XZ head 58X₂ and YZ head 58Y₂ placed symmetric to XZ head 58X₁ and YZ head 58Y₁ with respect to reference axis LV, and the irradiation points of the measurement beams irradiated on grating RG2 b from each of the XZ head 58X₂ and YZ head 58Y₂ set on the same straight line parallel to the Y-axis.

As each of the XZ heads 58X₁ and 58X₂ and the YZ heads 58Y₁ and 58Y₂, an encoder head having a structure similar to the displacement measurement sensor head disclosed in, for example, U.S. Pat. No. 7,561,280, can be used.

Head sections 52A and 52B structure an XZ linear encoder which measures position in the X-axis direction (X position) and position in the Z-axis direction (Z position) of gratings RG2 a and RG2 b and a YZ linear encoder which measures position in the Y-axis direction (Y position) and Z position, using scale members 54A and 54B, respectively. Gratings RG2 a and RG2 b, here, are formed on the upper surface of scale members 54A and 54B which are each fixed on surface plate 12 via support members 56, and head sections 52A and 52B are provided at head mounting member 51 which is integral with mark detection system MDS. As a result, head sections 52A and 52B measure the position (positional relation between mark detection system MDS and surface plate 12) of surface plate 12 with respect to mark detection system MDS. In the description below, for the sake of convenience, XZ linear encoder and YZ linear encoder will be described as XZ linear encoders 58X₁ and 58X₂ and YZ linear encoders 58Y₁ and 58Y₂ (refer to FIG. 6), using the same reference code as XZ heads 58X₁ and 58X₂ and YZ heads 58Y₁ and 58Y₂.

In the embodiment, XZ linear encoder 58X₁ and YZ linear encoder 58Y₁ structure a four-axis encoder 58 ₁ (refer to FIG. 7) that measures position information in each of the X-axis, the Y-axis, the Z-axis, and the θx directions with respect to mark detection system MDS of surface plate 12. Similarly, XZ linear encoder 58X₂ and YZ linear encoder 58Y₂ structure a four-axis encoder 58 ₂ (refer to FIG. 7) that measures position information in each of the X-axis, the Y-axis, the Z-axis, and the θx directions with respect to mark detection system MDS of surface plate 12. In this case, position information in the θy direction with respect to mark detection system MDS of surface plate 12 is obtained (measured), based on position information in the Z-axis direction with respect to mark detection system MDS of surface plate 12 measured by each of the four-axis encoders 58 ₁ and 58 ₂, and position information in the θz direction with respect to mark detection system MDS of surface plate 12 is obtained (measured), based on position information in the Y-axis direction with respect to mark detection system MDS of surface plate 12 measured by each of the four-axis encoders 58 ₁ and 58 ₂.

Accordingly, four-axis encoder 58 ₁ and four-axis encoder 58 ₂ structure the second position measurement system 50 which measures position information in directions of six degrees of freedom with respect to mark detection system MDS of surface plate 12, namely, measures information on relative position in directions of six degrees of freedom between mark detection system MDS and surface plate 12. The information on relative position in directions of six degrees of freedom between mark detection system MDS and surface plate 12 measured by the second position measurement system 50 is supplied at all times to controller 60, and based on this information on relative position, controller 60 controls the actuators of the three vibration isolators 14 real time so that the detection point of the first position measurement system 30 is in a desired positional relation with respect to the detection center of mark detection system MDS, or to be more specific, the position in the XY plane of the detection point of the first position measurement system 30 coincides with the detection center of mark detection system MDS such as at a nm level, and the surface of wafer W on slider 10 also coincides with the detection position of mark detection system MDS. In this case, for example, straight line CL previously described coincides with reference axis LV. Note that if the detection point of the first position measurement system 30 can be controlled to be in a desired positional relation with respect to the detection center of mark detection system MDS, the second position measurement system 50 does not have to measure the information on relative position in all directions of six degrees of freedom.

As is obvious from the description of the first position measurement system 30 and the description of the second position measurement system 50 previously described, in measurement device 100, the first position measurement system 30 and the second position measurement system 50 structure a position measurement system that measures position information in directions of six degrees of freedom of slider 10 with respect to mark detection system MDS.

FIG. 7 shows a block diagram of an input output relation of controller 60 which mainly structures a control system of measurement device 100 according to the embodiment. Controller 60 includes a workstation (or a microcomputer) or the like, and has overall control over each part of measurement device 100. As is shown in FIG. 7, measurement device 100 is equipped with wafer carrier system 70 placed inside a chamber along with component parts shown in FIG. 2. Wafer carrier system 70 consists of, for example, a horizontal multi-joint arm robot.

Next, a series of operations when processing a single lot of wafers in measurement device 100 according to the embodiment having the structure described above is described based on a flowchart in FIG. 8 that corresponds to a processing algorithm of controller 60.

As a premise, wafer W serving as a measurement target of measurement device 100 is to be a 300 mm wafer, and on wafer W, by exposure performed earlier on the previous layers, a plurality of, e.g. I (as an example, I=98) divided areas called shot areas (hereinafter called shots) are formed placed in a matrix state, and on street lines surrounding each shot or street lines inside each shot (in the case a plurality of chips are made in one shot), marks of a plurality of types of marks, such as search alignment marks for search alignment, wafer alignment marks (wafer marks) for fine alignment and the like are to be provided. The marks of the plurality of types of marks are formed along with the divided areas. In the embodiment, as the search marks and the wafer marks, two-dimensional marks are to be used.

Also, with measurement device 100, a plurality of measurement modes in which mark detection conditions by mark detection system MDS are different shall be settable. As the plurality of measurement modes, as an example, the following modes shall be settable; A-mode in which one wafer mark is detected for all shots in all wafers, and B-mode in which marks of a plurality of wafer marks are detected for all shots in a predetermined number of wafers at the beginning of a lot, and according to the detection results of the wafer marks, wafer marks subject to detection for each shot are decided for the remaining wafers in the lot, and the wafer marks that have been decided are detected.

Also, information necessary for alignment measurement to wafer W is input via an input device (not shown) in advance by an operator of measurement device 100, and the information is to be stored in a memory in controller 60. The information necessary for alignment measurement, here, includes information of various types such as; information on thickness of wafer W, information on flatness of wafer holder WH, and design information on shot areas and arrangement of alignment marks on wafer W. Note that setting information of the measurement mode is to be input in advance via the input device (not shown), for example, by the operator.

Processing algorithm corresponding to the flowchart in FIG. 8 is to start, for example, when coater/developer controller 320 of C/D 300 connected in-line to measurement device 100 requests permission for starting carriage of the wafers of one lot, controller 60 responds to the request, and then the first wafer is delivered to a predetermined delivery position (a first substrate delivery section to be described later on).

First of all, in step S102, a count value i of a counter showing the wafer number within the lot is initialized to 1 (i←1).

In the next step S104, wafer W is loaded onto slider 10. This loading of wafer W is performed by wafer carrier system 70 and the vertical movement member on slider 10 under the control of controller 60. Specifically, wafer W is carried from the wafer carrier (or delivery position) to a position above slider 10 located at the loading position by wafer carrier system 70, and by driver 13 driving the vertical movement member upward by a predetermined amount, wafer W is delivered to the vertical movement member. Then, after wafer carrier system 70 withdraws from the position above slider 10, the vertical movement member is driven downward by driver 13 so that wafer W is mounted on wafer holder WH on slider 10. Then, vacuum pump 11 is turned on, and wafer W loaded on slider 10 is vacuum chucked by wafer holder WH. Note that when measurement device 100 is connected in-line to the substrate processing device, the wafers are carried in sequentially from a wafer carrier system of the substrate processing device side, and are mounted to the delivery position.

In the next step, S106, position in the Z-axis direction (Z position) of wafer W is adjusted. Prior to this adjustment of Z position, controller 60 controls the internal pressure (drive force in the Z-axis direction that vibration isolators 14 generate) of the air mounts of the three vibration isolators 14 based on relative position information in the Z-axis direction, the θy direction, and the θx direction between mark detection system MDS and surface plate 12 measured by the second position measurement system 50, and surface plate 12 is set so that its upper surface becomes parallel to the XY plane and the Z position becomes a predetermined reference position. Wafer W is considered to have uniform thickness. Accordingly, in step S106, controller 60, based on thickness information of wafer W stored in memory, adjusts the drive force in the Z-axis direction that the three vibration isolators 14 generate, such as for example, the internal pressure (quantity of compressed air) of the air mount, so that surface plate 12 is driven in the Z-axis direction and the Z position of the wafer W surface is adjusted, so that the wafer W surface is set to a range in which the focal position of the optical system can be adjusted by the auto-focus function of mark detection system MDS. Note that in the case measurement unit 40 is equipped with a focal position detection system, controller 60 may perform Z position adjustment of the wafer surface based on detection results (output) of the focal position detection system. For example, mark detection system MDS may be equipped with a focal position detection system that detects position in the Z-axis direction of the wafer W surface via an optical element (objective optical element) at the tip portion. Also, adjustment on the Z position of the wafer W surface based on the detection results of the focal point position detection system can be performed, by moving slider 12 using vibration isolators 14 and moving slider 10 along with surface plate 12. Note that drive system 20 having a structure that can move slider 10 not only in directions within the XY plane but also in the Z-axis direction, the θx direction, and the θy direction may be employed, and slider 10 may be moved using drive system 20. Note that Z position adjustment of the wafer surface may include adjusting inclination of the wafer surface. When there is a possibility of an error (a kind of Abbe error) occurring due to Z position difference ΔZ between the arrangement surface of grating RG1 and the surface of wafer W by using drive system 20 to adjust the inclination of the wafer surface, at least one of the countermeasures like the ones described above should be executed.

In the next step, S108, search alignment of wafer W is performed. Specifically, for example, at least two search marks positioned in the periphery section almost symmetric with respect to the wafer W center are detected using mark detection system MDS. Controller 60 controls the drive of slider 10 by drive system 20, and while positioning each search mark within a detection area (detection field) of mark detection system MDS, acquires measurement information according to the first position measurement system 30 and measurement information according to the second position measurement system 50, and then obtains position information of each search mark based on detection signals when detecting the search mark formed on wafer W using mark detection system and measurement information according to the first position measurement system 30 (and measurement information according to the second position measurement system 50).

To be more specific, controller 60 obtains position coordinates on a reference coordinate system of the two search marks, based on detection results (relative positional relation between the detection center (index center) of mark detection system MDS obtained from the detection signals and each search mark) of mark detection system MDS output from signal processor 49, measurement values of the first position measurement system 30 (and measurement values of the second position measurement system 50) at the time of detection of each search mark. The reference coordinate system, here, is an orthogonal coordinate system set by the measurement axes of the first position measurement system 30.

After this, residual rotation error of wafer W is calculated from the position coordinates of the two search marks, and slider 10 is rotated finely so that the rotation error becomes almost zero. This completes search alignment of wafer W. Note that because wafer W is actually loaded onto slider 10 in a state where pre-alignment has been performed, center position displacement of wafer W is small enough to be ignored, and the residual rotation error is extremely small.

In the next step, S110, judgment is made of whether the measurement mode set is A-mode or not. And when the judgment in step S110 is positive, that is, in the case the measurement mode set is A-mode, then the operation moves to step S112.

In step S112, alignment measurement with respect to all wafers (full-shot one point measurement, in other words, full-shot EGA measurement), that is, one wafer mark is measured for each of the 98 shots. Specifically, controller 60 obtains the position coordinates on the reference coordinate system of the wafer mark on wafer W, that is, obtains the position coordinates of the shot, similar to the measurement of position coordinates of each search mark at the time of search alignment previously described. However, in this case, on calculating the position coordinates of the shot, measurement information of the second position measurement system 50 must be used, which is different from the time of search alignment. The reason is, as is previously described, controller 60, based on measurement information of the second position measurement system 50, controls the actuators of the three vibration isolators 14 real time so that the position in the XY plane of the detection point of the first position measurement system 30 coincides with the detection center of mark detection system MDS such as at a nm level, and the surface of wafer W on slider 10 also coincides with the detection position of mark detection system MDS. However, at the time of detection of the wafer mark, since there is no guarantee that the position in the XY plane of the detection point of the first position measurement system 30 coincides with the detection center of mark detection system MDS such as at a nm level, the position coordinates of the shot has to be calculated, taking into consideration the positional displacement of both the detection point and the detection center as offsets. For example, by correcting the detection results of mark detection system MDS or the measurement values of the first position measurement system 30 using the above offsets, the position coordinates on the reference coordinate system of the wafer mark on wafer W that are calculated can be corrected.

Here, on this full-shot one point measurement, controller 60 positions the wafer mark within the detection area of mark detection system MDS by moving slider 10 (wafer W) in at least one of the X-axis direction and the Y via drive system 20, based on the measurement information of the first position measurement system 30 and the measurement information of the second position measurement system 50. That is, the full-shot one point measurement is performed by moving slider 10 within the XY plane with respect to the mark detection system MDS using the step-and-repeat method.

Note that in case measurement unit 40 is equipped with the focal position detection system, controller 60 may perform adjustment of the Z position of the wafer surface based on detection results (output) of the focal position detection system, similar to the description in step S106.

On alignment measurement (full-shot one point measurement) to all wafer in step S112, while an offset load acts on surface plate 12 along with the movement, when slider 10 is moved within the XY plane, in the embodiment, controller 60 performs feedforward control individually on the three vibration isolators 14 according to the X, Y coordinate positions of the slider included in the measurement information of the first position measurement system 30 so that the influence of the offset load is canceled, and individually controls the drive force in the Z-axis direction that each vibration isolator 14 generates. Note that controller 60 may predict the offset load that acts on surface plate 12 and perform feedforward control individually on the three vibration isolators 14 so that the influence of the offset load is canceled, based on information on a known movement path of slider 10 without using the measurement information of the first position measurement system 30. Also, in the embodiment, since information on unevenness (hereinafter referred to as holder flatness information) of a wafer holding surface (a surface set by an upper end surface of multiple pins of a pin chuck) of wafer holder WH is obtained by experiment and the like in advance, on alignment measurement (e.g. full-shot one point measurement), when moving slider 10, controller 60, by performing feedforward control on the three vibration isolators 14 based on the holder flatness information to smoothly position an area including the wafer marks subject to measurement on the wafer W surface within a range of depth of focus of the optical system of mark detection system MDS, finely adjusts the Z position of surface plate 12. Note that one of feedforward control to cancel the offset load acting on surface plate 12 and feedforward control based on the holder flatness information described above or both of the controls do not have to be executed.

Note that in the case magnification can be adjusted in mark detection system MDS, the magnification may be set to low magnification on search alignment and to high magnification on alignment measurement. Also, in the case center position displacement and residual rotation error of wafer W loaded on slider 10 are small enough to be ignored, step S108 may be omitted.

In the full-shot one point measurement in step S112, actual values of position coordinates of a sample shot area (sample shot) in the reference coordinate system used in EGA calculation to be described later on are detected. Sample shot, of all the shots on wafer W, refers to a specific plurality of numbers of shots (at least three) determined in advance as shots used for EGA calculation to be described later on. Note that all shots on wafer W become sample shots in full-shot one point measurement. After step S112, the operation moves to step S124.

On the other hand, in the case the judgment in step S110 is negative, that is, in the case the mode set is B-mode, the operation moves to step S114 where judgment is made of whether or not count value i is smaller than a predetermined value K (K is a natural number that satisfies 1<K<I, and is a number decided in advance, e.g. 4). Note that count value i is incremented in step S128 to be described later on. And when judgment made in this step S114 is affirmative, the operation moves to step S120 where a full-shot multipoint measurement is performed on all shots. Full-shot multipoint measurement, here, means to measure each of a plurality of wafer marks for all shot areas on wafer W. The plurality of wafer marks that are to be measurement targets are decided in advance. For example, the measurement targets may be a plurality of wafer marks that are placed in an arrangement from which the shape of the shot (shape error from an ideal grating) can be obtained by statistical calculation. Since the procedure of measurement is similar to the case of full-shot one point measurement in step S112 except for the number of marks of the measurement target which is different, description in detail thereabout is omitted. After step S120, the operation moves to step S124.

On the other hand, in the case the judgment in step S114 is negative, the operation moves to step S116 where judgment is made of whether or not count value i is smaller than K+1. Here, since the judgment in step S116 is positive when count value i is both i≥K and i<k+1, accordingly, i=K.

In the case the judgment in step S116 is positive, the operation moves to step S118 where wafer marks that are to be measurement targets are decided for each shot, based on detection results of the wafer marks of wafer W to which measurement of K−1 wafers (e.g. in the case K=4, 3 wafers) has been performed so far. Specifically, the decision is made of whether detection of one wafer mark is enough, or a plurality of wafer marks should be detected for each shot. In the latter case, the wafer marks which should be subject to detection are also decided. For example, a difference (absolute value) between the actual measurement position and a design position of each of the plurality of wafer marks is to be obtained for each shot, and by judging whether or not a difference between the maximum value and the minimum value of the difference exceeds a certain threshold value, the decision is to be made of whether or not a plurality of wafer marks should be detected or detection of one wafer mark is enough for each shot. In the former case, for example, the wafer marks to be detected are decided so that the marks include a wafer mark having a maximum difference (absolute value) between the actual measurement position and the design position and a wafer mark having a minimum difference. After step S118, the operation moves to step S122.

On the other hand, in the case the judgment in step S116 is negative, the operation moves to step S122. Here, the judgment in step S116 is negative in the case count value i satisfies K+1≤i, and prior to this, count value always becomes i=K and wafer marks that are to be measurement targets are decided for each shot in step S118.

In step S122, the wafer marks that are decided to be measurement targets for each shot in step S118 are measured. Since the procedure of measurement is similar to the case of full-shot one point measurement in step S112 except for the number of marks of the measurement target which is different, description in detail thereabout is omitted. After step S122, the operation moves to step S124.

As is obvious from the description so far, in the case of B-mode, full-shot multipoint measurement is performed on wafers from the 1^(st) wafer within the lot to the K−1^(th) wafer (e.g. the 3^(rd) wafer), and from the K^(th) wafer (e.g. the 4^(th) wafer) to the I^(th) wafer (e.g. the 25^(th) wafer), measurement is to be performed on the wafer marks decided for each shots based on results of the full-shot multipoint measurement performed on the first K−1 wafers (e.g. three wafers).

In step 124, EGA operation is performed using position information of the wafer marks measured in any one of step S112, step S120, and step S122. EGA operation refers to a statistical calculation in which after measurement (EGA measurement) of the wafer marks described above, coefficients in a model formula expressing a relation between position coordinates of a shot and correction amounts of the position coordinates of the shot are obtained using statistical calculation such as a least squares method, based on the data of the difference between the design values and the actual measurement values of the position coordinates of the sample shot.

In the embodiment, as an example, the following model formula is used for calculating correction amounts from design values of a position coordinate of a shot.

$\begin{matrix} \left. \begin{matrix} {{dx} = {a_{0} + {a_{1} \cdot X} + {a_{2} \cdot Y} + {a_{3} \cdot X^{2}} + {a_{4} \cdot X \cdot Y} + {a_{5} \cdot Y^{2}} +}} \\ {{a_{6} \cdot X^{3}} + {{a_{7} \cdot X^{2}}Y} + {a_{8} \cdot X \cdot Y^{2}} + {{a_{9} \cdot Y^{3}}\mspace{14mu}\ldots}} \\ {{dy} = {b_{0} + {b_{1} \cdot X} + {b_{2} \cdot Y} + {b_{3} \cdot X^{2}} + {b_{4} \cdot X \cdot Y} + {b_{5} \cdot Y^{2}} +}} \\ {{b_{6} \cdot X^{3}} + {b_{7} \cdot X^{2} \cdot Y} + {b_{8} \cdot X \cdot Y^{2}} + {{b_{9} \cdot Y^{3}}\mspace{14mu}\ldots}} \end{matrix} \right\} & (1) \end{matrix}$

Here, dx and dy are correction amounts in the X-axis direction and the Y-axis direction from the design values of the position coordinates of the shot, and X and Y are design position coordinates of the shot in a wafer coordinate system using the center of wafer W as the origin. That is, formula (1) above is a polynomial expression related to design position coordinates X and Y for each shot in the wafer coordinate system using the center of the wafer as the origin, and is a model formula expressing a relation between position coordinates X and Y and correction amounts (alignment correction components) dx and dy of the position coordinates of the shot. Note that in the embodiment, since rotation between the reference coordinate system and the wafer coordinate system is canceled by the search alignment described earlier, in the following description, all the coordinate systems will be described as a reference coordinate system without distinguishing between the reference coordinate system and the wafer coordinate system in particular.

When using model formula (1), from position coordinates X and Y of a shot of wafer W, correction amounts of the position coordinates of the shot can be obtained. However, to calculate the correction amounts, coefficients a₀, a₁, . . . , b₀, b₁, . . . have to be obtained. After EGA measurement, based on the data of the difference between the design values and the actual measurement values of the position coordinates of the sample shot, coefficients a₀, a₁, . . . , b₀, b₁, . . . of the above formula (1) are obtained using statistical calculation such as a least squares method.

After coefficients a₀, a₁, . . . , b₀, b₁, . . . of model formula (1) have been decided, by obtaining correction amounts dx and dy of the position coordinates of each shot substituting design position coordinates X and Y of each shot (divided area) in the wafer coordinate system into model formula (1) whose coefficients are decided, true arrangement (including not only linear components but also nonlinear components as deformation components) of a plurality of shots (divided areas) on wafer W can be obtained.

Now, in the case of wafer W to which exposure has already been performed, the waveform of detection signals acquired as the measurement results is not always favorable for all wafer marks due to the influence of processing so far. When the positions of wafer marks having such defective measurement results (waveform of detection signals) are included in the above EGA operation, position error of the wafer marks having the defective measurement results (waveform of detection signals) will have adverse effects on the calculation results of coefficients a₀, a₁, . . . , b₀, b₁, . . . .

Therefore, in the embodiment, signal processor 49 only sends measurement results of the wafer marks that are favorable to controller 60, and controller 60 is to execute the EGA operation described above, using all the positions of the wafer marks whose measurement results have been received. Note that there is no limit in particular in the degree of the polynomial expression in the above formula (1). Controller 60 associates the results of EGA operation with identification information of the wafers (e.g. wafer number, lot number) along with information related to the marks used for the operation that are made into a file serving as alignment history data, and the file is stored in an internal or external memory device.

When EGA operation in step S124 is completed, then the operation moves to step S126 where wafer W is unloaded from slider 10. This unloading is performed under the control of controller 60, in a reversed procedure of the loading procedure in step S104 by wafer carrier system 70 and the vertical movement member on slider 10.

In the next step S128, after count value i of the counter has been incremented by 1 (i←i+1), the operation moves to step S130 where judgment is made of whether or not count value i is larger than the total number of wafers I in the lot. Then, when the judgment in this step S130 is negative, it is judged that processing to all the wafers in the lot is not yet complete, therefore the operation returns to step S104 and thereinafter repeats the processing (including judgment) from step S104 to step S130 until the judgment in step S130 turns positive.

Then, when the judgment in step S130 turns positive, it is judged that processing to all the wafers in the lot is complete, therefore this completes the series of processing in the present routine.

As is described in detail so far, with measurement device 100, on alignment measurement, for each of the I shots (e.g. 98 shots) on wafer W, position information is measured for at least one each of the wafer marks, and using this position information, coefficients a₀, a₁, . . . b₀, b₁, . . . of the above formula (1) are obtained using statistical calculation such as a least squares method. Accordingly, deformation components of the wafer grid can be accurately obtained not only for linear components but also for nonlinear components. Wafer grid, here, refers to a grid which is formed by connecting the center of shot areas on the wafer arranged according to a shot map (data related to arrangement of shot areas formed on the wafer). Obtaining correction amounts (alignment correction components) dx and dy of the position coordinates of the shot for a plurality of shots is no other than obtaining the deformation components of the wafer grid.

Exposure apparatus 200, as an example, is a projection exposure apparatus (scanner) of a step-and-scan method. FIG. 9 shows component parts inside the chamber of exposure apparatus 200, partly omitted.

Exposure apparatus 200, as is shown in FIG. 9, is equipped with an illumination system IOP, a reticle stage RST that holds reticle R, a projection unit PU that projects an image of a pattern formed on reticle R onto wafer W where a sensitive agent (resist) is coated, a wafer stage WST which moves within the XY plane holding wafer W, and a control system for these parts. Exposure apparatus 200 is equipped with a projection optical system PL that has an optical axis AX in the Z-axis direction parallel to optical axis AX1 of mark detection system MDS previously described.

Illumination system IOP includes a light source and an illumination optical system connected to the light source via a light-sending optical system, and illuminates a slit-shaped illumination area IAR narrowly extending in the X-axis direction (orthogonal direction of the page surface in FIG. 9) set (limited) on reticle R with a reticle blind (masking system) with an illumination light (exposure light) IL in an almost even illuminance. The structure of illumination system IOP is disclosed in, for example, U.S. Patent Application Publication No. 2003/0025890 and the like. Here, as illumination light IL, as an example, an ArF excimer laser beam (wavelength 193 nm) is used.

Reticle stage RST is arranged below illumination system IOP in FIG. 9. Reticle stage RST can be finely driven within a horizontal plane (XY plane) on a reticle stage surface plate (not shown) by a reticle stage drive system 211 (not shown in FIG. 9, refer to FIG. 10) including, e.g. a linear motor or the like, and can also be driven in a scanning direction (the Y-axis direction which is the lateral direction of the page surface in FIG. 9) in a range of predetermined strokes.

On reticle stage RST, a reticle R is mounted on which a pattern area and a plurality of marks whose positional relation with the pattern area is known are formed on a surface at the −Z side (pattern surface). Position information (including rotation information in the θz direction) within the XY plane of reticle stage RST is detected at all times by a reticle laser interferometer (hereinafter referred to as a “reticle interferometer”) 214 via a movable mirror 212 (or a reflection surface formed on an edge surface of reticle stage RST) at a resolution of around, e.g. 0.25 nm. Measurement information of reticle interferometer 214 is supplied to exposure controller 220 (refer to FIG. 10). Note that the position information within the XY plane of reticle stage RST described above may be measured by an encoder instead of reticle laser interferometer 214.

Projection unit PU is arranged below reticle stage RST in FIG. 9. Projection unit PU includes a barrel 240 and projection optical system PL held in barrel 240. Projection optical system PL, for example, is double telecentric and has a predetermined projection magnification (e.g. such as ¼ times, ⅕ times, or ⅛ times). Reticle R is placed so that its pattern surface almost coincides with a first surface (object plane) of projection optical system PL, and wafer W whose surface is coated with a resist (sensitive agent) is placed at a second surface (image plane) side of projection optical system PL. Therefore, when illumination light IL from illumination optical system IOP illuminates illumination area IAR on reticle R, illumination light IL that has passed reticle R forms a reduced image of a circuit pattern of reticle R (a reduced image of a part of the circuit pattern) in illumination area IAR on an area (hereafter also called an exposure area) IA on wafer W conjugate with illumination area IAR via projection optical system PL. And by relatively moving reticle R in the scanning direction (the Y-axis direction) with respect to illumination area IAR (illumination light IL) and also relatively moving wafer W in the scanning direction (the Y-axis direction) with respect to exposure area IA (illumination light IL) in accordance with synchronous drive of reticle stage RST and wafer stage WST, scanning exposure of a shot area (divided area) on wafer W is performed, and the pattern of reticle R is transferred onto the shot area.

As projection optical system PL, as an example, a refraction system is used, consisting only of a plurality of, e.g. around 10 to 20 refraction optical elements (lens elements) arranged along optical axis AX parallel to the Z-axis direction. Of the plurality of lens elements structuring this projection optical system PL, a plurality of lens elements on the object plane side (reticle R side) are movable lenses which are shifted and driven in the Z-axis direction (optical axis direction of projection optical system PL) by driving elements (not shown) such as piezo elements and are drivable in inclination directions (that is, the θx direction and the θy direction) with respect to the XY plane. Then, based on instructions from exposure controller 220, an image forming characteristic correction controller 248 (not shown in FIG. 9, refer to FIG. 10) independently adjusts applied voltage to each driving element, which allows each movable lens to be individually driven, and various image forming characteristics of projection optical system PL (such as magnification, distortion aberration, astigmatism, coma aberration, and curvature of field) are to be adjusted. Note that instead of, or in addition to moving the movable lenses, an air tight chamber can be provided between specific lens elements that are adjacent inside barrel 240, and image forming characteristic correction controller 248 may be made to control the pressure of gas inside the air tight chamber, or image forming characteristic correction controller 248 may have the structure of being able to shift a center wavelength of illumination light IL. These structures also allow adjustment of the image forming characteristics of projection optical system PL.

Wafer stage WST is driven in predetermined strokes in the X-axis direction and the Y-axis direction on a wafer stage surface plate 222 by a stage drive system 224 (shown in a block in FIG. 9 for the sake of convenience) including a planar motor, a linear motor or the like, and is also finely driven in the Z-axis direction, the θx direction, the θy direction, and the θz direction. On wafer stage WST, wafer W is held by vacuum chucking or the like via a wafer holder (not shown). In the embodiment, the wafer holder is to be able to hold by suction a 300 mm wafer. Note that instead of wafer stage WST, a stage device equipped with a first stage that moves in the X-axis direction, the Y-axis direction, and the θz direction and a second stage that finely moves on the first stage in the Z-axis direction, the θx direction and the θy direction may also be used. Note that one of, or both of wafer stage WST and the wafer holder of wafer stage WST may be called a “second substrate holding member”.

Position information within the XY plane of wafer stage WST (including rotation information (yawing quantity (rotation quantity θz in the θz direction), pitching quantity (rotation quantity θx in the θx direction), rolling quantity (rotation quantity θy in the θy direction))) is detected at all times by a laser interferometer system (hereinafter shortly referred to as interferometer system) 218 via a movable mirror 216 (or a reflection surface formed on an edge surface of wafer stage WST) at a resolution of, for example, around 0.25 nm. Note that position information within the XY plane of wafer stage WST may be measured by an encoder system instead of interferometer system 218.

Measurement information of interferometer system 218 is supplied to exposure controller 220 (refer to FIG. 10). Exposure controller 220, based on measurement information of interferometer system 218, controls position (including rotation in the θz direction) within the XY plane of wafer stage WST via stage drive system 224.

Also, although it is not illustrated in FIG. 9, position and inclination quantity in the Z-axis direction of the surface of wafer W are measured, for example, using a focus sensor AFS (refer to FIG. 10) consisting of a multi-point focal position detection system of an oblique incidence method disclosed in, for example, U.S. Pat. No. 5,448,332 and the like. Measurement information of this focus sensor AFS is also supplied to exposure controller 220 (refer to FIG. 10).

Also, on wafer stage WST, a reference plate FP having a surface which is the same height as that of the surface of wafer W is fixed. Formed on the surface of this reference plate FP are a first reference mark used for base line measurement or the like of an alignment detection system AS and a pair of second reference marks detected by a reticle alignment detection system to be described later on.

On the side surface of barrel 240 of projection unit PU, alignment detection system AS is provided that detects alignment marks formed on wafer W or the first reference marks. As alignment detection system AS, as an example, an FIA (Field Image Alignment) system is used which is a type of image forming alignment sensor using an image processing method to measure a mark position by illuminating the mark with a broadband (wide band) light such as a halogen lamp and image processing an image of the mark. Note that instead of or along with alignment detection system AS by the image processing method, a diffracted light interference type alignment system may also be used.

In exposure apparatus 200, further above reticle stage RST, a pair of reticle alignment detection systems 213 (not shown in FIG. 9, refer to FIG. 10) that can simultaneously detect a pair of reticle marks located at the same Y position on reticle R mounted on reticle stage RST are provided arranged a predetermined distance apart in the X-axis direction. Detection results of the marks by reticle alignment detection system 213 are supplied to exposure controller 220.

FIG. 10 shows an input/output relation of exposure controller 220 in a block diagram. As is shown in FIG. 10, other than the component parts described above, exposure apparatus 200 is equipped with parts such as a wafer carrier system 270 for carrying the wafer connected to exposure controller 220. Exposure controller 220 includes a microcomputer, a workstation and the like, and has overall control over the apparatus including the component parts described above. Wafer carrier system 270, for example, consists of a horizontal multi-joint arm robot.

Referring back to FIG. 1, although it is omitted in the drawings, C/D 300 is equipped with, for example, a coating section that performs coating of a sensitive agent (resist) with respect to a wafer, a developing section that can develop a wafer, a baking section that performs pre-bake (PB) and pre-develop bake (post-exposure bake: PEB), and a wafer carrier system (hereinafter referred to as a C/D inner carrier system for the sake of convenience). C/D 300 is furthermore equipped with a temperature controlling section 330 that can control the temperature of the wafer. Temperature controlling section 330 is normally a cooling section, and is equipped, for example, with a flat plate (temperature controlling device) called a cool plate. The cool plate is cooled, for example, by circulating cooling water. Other than this, thermoelectric cooling by the Peltier effect may be used in some cases.

Storage device 400 includes a control device connected to LAN 500 and a storage device connected to the control device via a communication channel such as Small Computer System Interface (SCSI).

With lithography system 1000 according to the embodiment, measurement device 100, exposure apparatus 200, and C/D 300 each have a bar code reader (not shown), and while the wafer is being carried by each of wafer carrier system 70 (refer to FIG. 7), wafer carrier system 270 (refer to FIG. 10), and the C/D inner carrier system (not shown), the bar code reader appropriately reads identification information of each wafer such as, e.g. wafer number, lot number and the like. Hereinafter, description related to the reading of identification information of each wafer using the bar code reader will be omitted to simplify the description.

In lithography system. 1000, exposure apparatus 200, C/D 300 and measurement device 100 (hereinafter also appropriately called three devices 100, 200, and 300) each perform processing on many wafers continuously. In lithography system 1000, the overall processing sequence is decided so that throughput of the system in total becomes maximum, that is, for example, processing time of other devices completely overlap the processing time of the device that requires the longest time for processing.

In the description below, a flow of operations performed in the case of processing many wafers continuously with lithography system 1000 will be described.

Firstly, the C/D inner carrier system (e.g. SCARA robot) takes out the first wafer (refer to as W₁) from a wafer carrier placed within the chamber of C/D 300 and delivers the wafer to the coating section. In accordance with the delivery, the coating section begins coating of resist. When the coating of resist is completed, the C/D inner carrier system takes out wafer W₁ from the coating section, and delivers the wafer to the baking section. In accordance with the delivery, the baking section begins heating processing (PB) of wafer W₁. Then, when PB of the wafer is completed, the C/D inner carrier system takes out wafer W₁ from the baking section, and delivers the wafer to temperature controlling section 330. In accordance with the delivery, cooling of wafer W₁ using the cool plate inside temperature controlling section 330 begins. This cooling is performed with the target temperature being a temperature which does not have any influence inside exposure apparatus 200, generally, for example, the target temperature of an air conditioning system of exposure apparatus 200 which is decided in a range of 20 to 25 degrees. Normally, at the point when the wafer is delivered to temperature controlling section 330, the temperature of the wafer is within a range of ±0.3[° C.], however, temperature controlling section 330 adjusts the temperature to a range of ±10[mK] to the target temperature.

Then, when the cooling (temperature control) inside the temperature controlling section 330 is completed, wafer W₁ is mounted on a first substrate delivery section provided in between C/D 300 and measurement device 100 by the C/D inner carrier system.

Inside C/D 300, a series of operations on wafers similar to the ones described above as in resist coating, PB, cooling, and carrying operation of the wafers described above that accompanies the series of operations are repeatedly performed, and the wafers are sequentially mounted on the first substrate delivery section. Note that practically by providing two or more each of the coating section and the C/D inner carrier system inside the chamber of C/D 300, parallel processing on a plurality of wafers becomes possible and the time required for pre-exposure processing can be shortened.

In measurement device 100, wafer W₁ before exposure sequentially mounted on the first substrate delivery section by the C/D inner carrier system is loaded on slider 10 in the procedure described earlier in the first embodiment by the cooperative work between wafer carrier system 70 and the vertical movement member on slider 10. After the loading, measurement device 100 performs alignment measurement of the wafer in the measurement mode set, and controller 60 obtains the correction amounts (coefficients a₀, a₁, . . . b₀, b₁, . . . of the above formula (1)) of the position coordinates of the shot on wafer W.

Controller 60 correlates historical information such as the correction amounts (coefficients a₀, a₁, . . . b₀, b₁, . . . of the above formula (1)) of the position coordinates obtained, information on the wafer marks whose position information of the marks are used to calculate the correction amounts, information on the measurement mode, and information on all wafer marks whose detection signals were favorable and identification information (wafer number, lot number) of wafer W₁ and makes an alignment history data (file), and stores the information in storage device 400.

Thereafter, wafer carrier system 70 mounts wafer W₁ that has finished alignment measurement on a loading side substrate mounting section of a second substrate delivery section provided near measurement device 100 inside the chamber of exposure apparatus 200. Here, in the second substrate delivery section, loading side substrate mounting section and an unloading side substrate mounting section are provided.

Hereinafter, in measurement device 100, to the second wafer and after in the same procedure as wafer W₁, alignment measurement, making of alignment history data (file), and wafer carriage are to be repeatedly performed.

Wafer W₁ mounted on the loading side substrate mounting section previously described is carried to a predetermined waiting position inside exposure apparatus 200 by wafer carrier system 270. However, the first wafer, wafer W₁ is immediately loaded onto wafer stage WST by exposure controller 220, without waiting at the waiting position. This loading of the wafer is performed in a similar manner as the loading performed by exposure controller 220 at the measurement device 100 previously described, using the vertical movement member (not shown) on wafer stage WST and wafer carrier system 270. After the loading, search alignment similar to the description earlier using alignment detection system AS and wafer alignment by the EGA method whose alignment shots are, e.g. 3 to 16 shots, are performed on the wafer on wafer stage WST. On this wafer alignment by the EGA method, exposure controller 220 of exposure apparatus 200 searches the alignment history data file stored in storage device 400, with the identification information of the wafer (target wafer) subject to wafer alignment and exposure serving as a key, and acquires the alignment history data of the target wafer. Then, after predetermined preparatory operations, exposure controller 220 performs the wafer alignment described later on, according to information on the measurement mode included in the alignment history data which has been acquired.

Here, prior to a concrete description of wafer alignment, the reason for exposure apparatus 200 performing wafer alignment by the EGA method in which to around 3 to 16 shots serve as alignment shots will be described.

The correction amounts (coefficients a₀, a₁, . . . b₀, b₁, . . . of the above formula (1)) of the position coordinates of the shot on wafer W obtained by measurement device 100 is used, for example, for positioning of the wafer with respect to the exposure position on exposure by exposure apparatus 200. However, wafer W whose correction amounts of the position coordinates have been measured by measurement device 100 according to exposure apparatus 200, is unloaded from slider 10 in the manner described earlier and then is loaded on the wafer stage for exposure. In this case even if the same type of wafer holders were used for wafer holder WH on slider 10 and the wafer holder on wafer stage WST of exposure apparatus 200, the holding state of wafer W differs between wafer holder WH on slider 10 and the wafer holder on the wafer stage of exposure apparatus 200 due to individual differences of the wafer holders. Therefore, even if the correction amounts (coefficients a₀, a₁, . . . b₀, b₁, . . . of the above formula (1)) of the position coordinates of the shot on wafer W were obtained by measurement device 100, the coefficients a₀, a₁, . . . b₀, b₁, . . . cannot all be used as they are. However, it is considered that the different holding state of wafer W for each wafer holder affects lower-degree components (linear components) not exceeding the first-degree of the correction amounts of the position coordinates of the shot, and hardly affects higher-degree components exceeding the second-degree. The reason for this is higher-degree components exceeding the second-degree are considered to be components that occur due to deformation of wafer W due to process, and it can be considered that the components are unrelated to the holding state of the wafer by the wafer holder.

Based on such consideration, coefficients a₃, a₄, . . . , a₉, . . . , and b₃, b₄, . . . b₉ of the higher-degree components which measurement device 100 takes time to obtain for wafer W can also be used without change as coefficients of higher-degree components of the correction amount of the position coordinates of wafer W in exposure apparatus 200. Accordingly, on the wafer stage of exposure apparatus 200, only a simple EGA measurement (e.g. measurement of around 3 to 16 wafer marks) is enough to obtain the linear components of the correction amount of the position coordinates of wafer W.

First of all, the case will be described when information of A-mode is included. In this case, a number of wafer marks corresponding to the number of alignment shots are selected as detection targets from the wafer marks whose position information are measured (marks whose position information are used for calculating the correction amount) by measurement device 100 included in the alignment history data, and the wafer marks serving as detection targets are detected using alignment detection system AS, and based on the detection results and the position (measurement information by interferometer system 218) of wafer stage WST at the time of detection, position information of each wafer mark that are detection targets are obtained, and using the position information, EGA operation is performed and each of the coefficients of the following formula (2) are obtained.

$\begin{matrix} \left. \begin{matrix} {{dx} = {c_{0} + {c_{1} \cdot X} + {c_{2} \cdot Y}}} \\ {{dy} = {d_{0} + {d_{1} \cdot X} + {d_{2} \cdot Y}}} \end{matrix} \right\} & (2) \end{matrix}$

Then, exposure controller 220 substitutes the coefficients (c₀, c₁, c₂, d₀, d₁, d₂) obtained here to coefficients (c₀, c₁, c₂, d₀, d₁, d₂) included in the alignment history data, obtains correction amounts (alignment correction components) dx and dy of the position coordinates of each shot using polynomial expressions related to the design position coordinates X and Y of each shot in a wafer coordinate system whose origin is the center of the wafer expressed by the following formula (3) which includes the coefficients after the substitution, and based on these correction amounts, decides a target position (hereinafter called a positioning target position for the sake of convenience) for positioning with respect to an exposure position (projection position of a reticle pattern) on exposure of each shot for correcting the wafer grid. Note that in the embodiment, while the exposure is performed by the scanning exposure method and not by the static exposure method, the term positioning target position is used for the sake of convenience.

$\begin{matrix} \left. \begin{matrix} {{dx} = {c_{0} + {c_{1} \cdot X} + {c_{2} \cdot Y} + {a_{3} \cdot X^{2}} + {a_{4} \cdot X \cdot Y} + {a_{5} \cdot Y^{2}} +}} \\ {{a_{6} \cdot X^{3}} + {{a_{7} \cdot X^{2}}Y} + {a_{8} \cdot X \cdot Y^{2}} + {{a_{9} \cdot Y^{3}}\mspace{14mu}\ldots}} \\ {{dy} = {d_{0} + {d_{1} \cdot X} + {d_{2} \cdot Y} + {b_{3} \cdot X^{2}} + {b_{4} \cdot X \cdot Y} + {b_{5} \cdot Y^{2}} +}} \\ {{b_{6} \cdot X^{3}} + {b_{7} \cdot X^{2} \cdot Y} + {b_{8} \cdot X \cdot Y^{2}} + {{b_{9} \cdot Y^{3}}\mspace{14mu}\ldots}} \end{matrix} \right\} & (3) \end{matrix}$

Note that also in exposure apparatus 200, since rotation between the reference coordinate system (stage coordinate system) that sets the movement of wafer stage WST and the wafer coordinate system is canceled due to search alignment, there is no need to distinguish between the reference coordinate system and the wafer coordinate system in particular.

Next, the case will be described when B-mode is set. In this case, exposure controller 220 decides a positioning target position for each shot for correcting the wafer grid according to a similar procedure as the above A-mode. However, in this case, in the alignment history data, of a plurality of wafer marks for some shots and one wafer mark each for the remaining shots, wafer marks whose detection signals were favorable are included as the wafer marks whose position information of the mark are used to calculate the correction amount.

Then, in addition to deciding the positioning target position of each shot described above, exposure controller 220, select a number of wafer marks necessary to obtain the shape of the shot from the above plurality of wafer marks for some shots, and using the position information (actual measurement value) of these wafer marks, perform statistical calculation (also referred to as in-shot multi-point EGA operation) applying a least squares method on a model formula [Mathematical 7] disclosed in, for example, U.S. Pat. No. 6,876,946, and obtains the shape of the shot. Specifically, of the 10 parameters in the model formula [Mathematical 7] disclosed in the above U.S. Pat. No. 6,876,946, chip rotation (θ), chip rectangular degree error (w), and chip scaling (rx) in the x-direction and chip scaling (ry) in the y-direction are obtained. As for the in-shot multi-point EGA operation, since the details are disclosed in detail in the above U.S. patent, the description thereabout will be omitted.

Then, exposure controller 220 performs exposure by the step-and-scan method on each shot on wafer W₁, while controlling the position of wafer stage WST according to the positioning target positions. Here, in the case the shape of the shot is also obtained by in-shot multi-point EGA measurement, during scanning exposure, at least one of a relative scanning angle between reticle stage RST and wafer stage WST, scanning speed ratio, relative position of at least one of reticle stage RST and wafer stage WST with respect to the projection optical system, image forming characteristic (aberration) of projection optical system PL, and wavelength of illumination light (exposure light) is adjusted so that the projection image of the pattern of reticle R by projection optical system PL changes in accordance with the shape of shot obtained. Here, adjustment of the image forming characteristic (aberration) of projection optical system PL and adjustment of the center wavelength of illumination light IL are performed by exposure controller 200 via image forming characteristic correction controller 248.

In parallel with EGA wafer alignment and exposure performed on the wafer (in this case, wafer W₁) on wafer stage WST, measurement device 100 executes wafer alignment measurement in the mode set, making of alignment history data and the like on a second wafer (referred to as wafer W₂) in the procedure previously described.

Then, before exposure is completed on the wafer (in this case, wafer W₁) on wafer stage WST, measurement processing of measurement device 100 is completed and the second wafer W₂ is mounted on the loading side substrate mounting section by wafer carrier system 70, carried to a predetermined waiting position inside exposure apparatus 200 by wafer carrier system 270, and then is to wait at the waiting position.

Then, when exposure of wafer W₁ is completed, wafer W₁ and wafer W₂ are exchanged on the wafer stage, and to wafer W₂ that has been exchanged, wafer alignment and exposure similar to the previous description is performed. Note that in the case carriage to the waiting position of wafer W₂ cannot be completed by the time exposure on the wafer (in this case, wafer W₁) on the wafer stage is completed, the wafer stage is to wait near the waiting position while holding the wafer which has been exposed.

In parallel with the wafer alignment to wafer W₂ that has been exchanged, wafer carrier system 270 carries wafer W₁ that has been exposed to the unloading side substrate mounting section of the second substrate delivery section.

Hereinafter, as is previously described, wafer carrier system 70, in parallel with the alignment measurement of the wafer by measurement device 100, is to repeatedly perform the operation of carrying and mounting the wafer that has been exposed from the unloading side substrate mounting section onto the first substrate delivery section, and the operation of taking out the wafer before exposure that has completed measurement from slider 10 and carrying the wafer to the loading side substrate mounting section in a predetermined degree.

The wafer that has been exposed carried and mounted on the first substrate delivery section by wafer carrier system 70 in the manner described earlier is carried into the baking section by the C/D inner carrier system where PEB is performed on the wafer by a baking apparatus in the baking section. The baking section can simultaneously house a plurality of wafers.

Meanwhile, the wafer that has completed PEB is taken out from the baking section by the C/D inner carrier system, and then carried into the developing section where development by a developing apparatus begins inside.

Then, when developing of the wafer is completed, the wafer is taken out from the developing section by the C/D inner carrier system and delivered to a predetermined housing shelf inside the wafer carrier. Hereinafter, in C/D 300, in the procedure similar to that of wafer W₁, the operations of PEB, development, and wafer carriage are to be repeatedly performed on the second wafer that has been exposed and the wafers thereafter.

As is described so far, with lithography system 1000 according to the embodiment, in parallel with the processing operations of the wafer by exposure apparatus 200 including the simple EGA measurement and exposure described earlier, measurement device 100 can perform alignment measurement of the wafer and can perform efficient processing without hardly reducing the throughput of the wafer processing. Moreover, measurement device 100 can perform full-shot EGA in which all shots serve as sample shots in parallel with the wafer alignment and exposure operation of exposure apparatus 200. Further, since the coefficients of the higher-degree components in the model formula obtained by the full-shot EGA can be used without any changes in exposure apparatus 200, by only performing alignment measurement in which several shots serve as alignment shots and obtaining the coefficients of the lower-degree components in the above model formula in exposure apparatus 200, and using these coefficients of the lower-degree components and the coefficients of the higher-degree components obtained by measurement device 100, not only coefficients (undetermined coefficients) of the lower-degree components of model formula (1) but also coefficients (undetermined coefficients) of the higher-degree components can be determined, and by using model formula (1) whose undetermined coefficients are determined (that is, the above formula (3)) and design values (X, Y) of the arrangement of the plurality of shots on the wafer, correction amount from the design position of each shot can be obtained, which makes it possible to acquire correction amount with good precision, similar to the case in which coefficients of the lower-degree and higher-degree components of model formula (1) were obtained in exposure apparatus 200.

Also, with measurement device 100 according to the embodiment, controller 60 acquires position information of slider 10 with respect to surface plate 12 and relative position information between mark detection system MDS and surface plate 12 using the first position measurement system 30 and the second position measurement system 50, as well as obtain position information on the plurality of marks formed on wafer W using mark detection system MDS, while controlling movement of slider 10 by drive system 20. Accordingly, with measurement device 100, position information on the plurality of marks formed on wafer W can be obtained with good accuracy.

Also, with measurement device 100 according to the embodiment, controller 60 acquires measurement information (relative position information between surface plate 12 and mark detection system MDS) from the second position measurement system 50 at all times, and controls the position in directions of six degrees of freedom of surface plate 12 real time via (the actuators of) the three vibration isolators 14 so that the positional relation between the detection center of mark detection system MDS and the measurement point of the first position measurement system detecting position information in directions of six degrees of freedom of slider 10 with respect to surface plate 12 is maintained in a desired relation at a nm level. Also, controller 60 acquires measurement information (position information of slider 10 with respect to surface plate 12) by the first position measurement system 30 and measurement information (relative position information between surface plate 12 and mark detection system MDS) by the second position measurement system 50 while controlling the drive of slider 10 by drive system 20, and obtains position information on the plurality of wafer marks based on detection signals at the time of detection of the marks formed on wafer W using mark detection system MDS, measurement information by the first position measurement system 30 obtained at the time of detection of the marks formed on wafer W using mark detection system MDS, and measurement information by the second position measurement system 50 obtained at the time of detection when detecting the marks formed on wafer W using mark detection system MDS. Accordingly, with measurement device 100, position information on the plurality of marks formed on wafer W can be obtained with good accuracy.

Note that, for example, in the case of performing position control of wafer W (wafer stage WST) on exposure (to be described later on) based on position information of the marks that has been measured without performing EGA operation using the position information that has been measured, the measurement information by the second position measurement system 50 described above, for example, does not have to be used to calculate the position information. However, in this case, offset should be applied to use the measurement information by the second position measurement system 50 obtained at the time of detection when detecting the marks formed on wafer W using mark detection system MDS, and information used for moving wafer W may be corrected such as, for example, a positioning target value of wafer W (wafer stage WST). Or, taking into consideration the above offset, movement of a reticle R (reticle stage RST) at the time of exposure which will be described later on may be controlled.

Also, with measurement device 100 according to the embodiment, the first position measurement system 30 that measures position information in directions of six degrees of freedom of slider 10 on which wafer W is mounted and held can continue to irradiate a measurement beam on grating RG1 from head section 32 in a range where slider 10 moves for detecting at least the wafer marks on wafer W with mark detection system MDS. Accordingly, the first position measurement system 30 can measure position information continuously in the whole range within the XY plane where slider 10 moves for mark detection. Accordingly, for example, in a making stage (including a startup stage of the device in a semiconductor manufacturing factory) of measurement device 100, by performing origin setting of an orthogonal coordinate system (reference coordinate system) set by the measurement axis of the first position measurement system. 30, namely the grating of grating RG1, it becomes possible to acquire absolute coordinate positions of slider 10 within the XY plane, which in turn makes it possible to obtain absolute positions within the XY plane of marks (not limited to search marks and wafer marks, and also includes other marks such as overlay measurement marks (registration marks)) on wafer W held on slider 10 that are obtained from position information of slider 10 measured by the first position measurement system 30 and detection results of mark detection system MDS. Note that “absolute position coordinate” in the description will refer to a position coordinate in the above reference coordinate system.

In lithography system. 1000 according to the embodiment, for example, in the case throughput of wafer processing of the whole lithography system 1000 is not to be decreased more than necessary, the wafer that has been developed may be loaded on slider 10 of measurement device 100 again in the procedure similar to the wafer after PB and before exposure previously described, and measurement of positional displacement of the overlay displacement measurement mark (e.g. a box-in-box mark) formed on the wafer may be performed. That is, since measurement device 100 can measure the absolute value of the marks on the wafer (on the reference coordinate system according to the first position measurement system 30), not only is the measurement device suitable for wafer alignment measurement but also as a measurement device for performing positional displacement measurement of the overlay displacement measurement marks which is a kind of relative position measurement.

Note that with lithography system 1000 according to the embodiment, in exposure apparatus 200, the case has been described where coefficients of the lower-degree components of the first degree or less of the above model formula are obtained, and the coefficients of the lower-degree components and coefficients of the higher-degree components of the second degree or more of the above model formula acquired by measurement device 100 are used. However, the embodiment is not limited to this, and for example, coefficients of the components of the second degree or less of the above model formula may be obtained from the detection results of the alignment marks in exposure apparatus 200, and the coefficients of the components of the second degree or less and coefficients of the higher-degree components of the third degree or more of the above model formula acquired by measurement device 100 may be used. Or, for example, coefficients of the components of the third degree or less of the above model formula may be obtained from the detection results of the alignment marks in exposure apparatus 200, and the coefficients of the components of the third degree or less and coefficients of the higher-degree components of the fourth degree or more of the above model formula acquired by measurement device 100 may be used. That is, coefficients of the components of the (N−1)^(th) degree (N is an integer of 2 or more) or less of the above model formula may be obtained from the detection results of the alignment marks in exposure apparatus 200, and the coefficients of the components of the (N−1)^(th) degree and coefficients of the higher-degree components of the N^(th) degree or more of the above model formula acquired by measurement device 100 may be used.

Note that in the embodiment above, while measurement device 100 obtained coefficients a₃, a₄, a₅ . . . and b₃, b₄, b₅ . . . of the higher-degree components of the 2^(nd) degree or more and also coefficients a₀, a₁, a₂ . . . and b₀, b₁, b₂ . . . of the lower-degree components of the first-degree or under of model formula (1) that expresses the relation between design position coordinates X, Y of each shot in the wafer coordinate system (coincides with the reference coordinate system) and correction amounts (alignment correction components) dx and dy of the position coordinates of the shot, coefficients of the lower-degree components may be obtained by exposure apparatus 200, therefore, measurement device 100 does not necessarily have to obtain the coefficients of the lower-degree components.

Note that in the embodiment above, in the case B-mode is set in measurement device 100 and position information of a plurality of wafer marks is measured for several shots on the wafer, in exposure apparatus 200, chip rotation (θ), chip rectangular degree error (w), and chip scaling (rx) in the x-direction and chip scaling (ry) in the y-direction are obtained by in-shot multi-point EGA operation, and then the shape of shots obtained. However, this is not limiting, and for example, even in the case A-mode is set in measurement device 100 and position information of one wafer mark each in all the shots on the wafer, it is possible for exposure apparatus 200 to assume deformation (shape deformation of shots) of shots. This will be described below.

Wafer grid on wafer W is deformed due to process, and the individual shots are also slightly deformed by the process, however, the deformation is considered to reflect the deformation of the wafer grid. Change component of the wafer grid can be separated into the following four independent change components, and each change component induces the change described below.

-   -   (1) Change component in the X-axis direction of dx Magnification         change in the X-axis direction     -   (2) Change component in the Y-axis direction of dx Rotation         around the Y axis     -   (3) Change component in the X-axis direction of dy Magnification         change in the Y-axis direction     -   (4) Change component in the Y-axis direction of dy Rotation         around the X-axis

Accordingly, in the embodiment, the change components (1) to (4) described above are calculated, and based on the change components, deformation of the shots on wafer W is estimated.

Now, there are roughly the following two ways to estimate the deformation of shots.

(A) Partially differentiate X and Y of formula (3) whose coefficients (undetermined coefficients) have been determined, and deform the shots according to the partially differentiated values.

(B) Approximate the wafer grid to a model formula of a first-degree, and deform the shots according to the coefficients of the model formula. Here, the wafer grid can be calculated by substituting design position coordinates X, Y of each shot into formula (3) whose coefficients (undetermined coefficients) have been determined and obtaining the correction amounts (correction amounts dx, dy of the position coordinates of the shots) from the design values of the arrangement of the plurality of shots on the wafer, that is, obtaining the change components of the wafer grid, and using the correction amounts and the design values of the position coordinates of the shots.

Hereinafter, the method of correcting the shape of the shots by the estimation method in (A) will be referred to as a higher-degree partial differentiation correction and the method of correcting the shape of the shots by the estimation method in (B) will be referred to as a first-degree approximation correction. Even if the wafer grid is same, deformation state of the shots will differ in the case of using the method in (A) and using the method in (B).

FIGS. 12A and 12B schematically show an example of a difference between the higher-degree partial differentiation correction and the first-degree approximation correction. Here, to simplify the description, Y component in the wafer grid shall be dy=b₆·X³. FIG. 12A shows three shots corrected by the higher-degree partial differentiation correction. When performing higher-degree partial differentiation correction, rotation around the Y-axis of each shot is to follow the partial differentiation of dy=b₆·X³, which is dy′=3b₆·X². In this case, of the three shots, deformation amount of the shot in the center becomes 0. Meanwhile, FIG. 12B shows three shots corrected by the first-degree approximation correction. In the first-degree approximation correction, alignment by the EGA method is performed again using the first-degree model formula, and correction of the shape of shots is performed using the first-degree coefficients related to rotation around the Y-axis in the model formula. When this first-degree approximation correction is performed, as is shown in FIG. 12B, of the three shots, the shot in the center also has a first-degree deformation, and as a whole, the deformation of each shot becomes uniform.

As is shown in FIGS. 12A and 12B, while deformation of the shots in the higher-degree partial differentiation correction occurs locally along the wafer grid in the shot periphery, in the first-degree approximation correction, the deformation becomes uniform in all shots of wafer W. For example, either of the methods may be specified in an exposure recipe, and the user may appropriately select the method from an exposure recipe according to specification requirements of the semiconductors to be manufactured. In the case such estimation of deformation of the shots is performed, as is previously described, exposure controller 220, during scanning exposure, adjusts at least one of a relative scanning angle between reticle stage RST and wafer stage WST, scanning speed ratio, relative position of at least one of reticle stage RST and wafer stage WST with respect to the projection optical system, image forming characteristic (aberration) of projection optical system PL, and wavelength of illumination light (exposure light) is adjusted so that the projection image of the pattern of reticle R by projection optical system PL changes in accordance with the shape of shot obtained (shape of shots obtained by the estimation of deformation of the shots).

Note that in the lithography system according to the embodiment, in the case measurement unit 40 of measurement device 100 is equipped with the multi-point focal position detection system previously described, measurement device 100 may perform flatness measurement (also called focus mapping) of wafer W along with the wafer alignment measurement. In this case, by using the results of the flatness measurement, focus-leveling control of wafer W at the time of exposure becomes possible without exposure apparatus 200 performing the flatness measurement.

Note that in the embodiment above, while the target was a 300 mm wafer, the embodiment is not limited to this, and the wafer may also be a 450 mm wafer that has a diameter of 450 mm. Since measurement device 100 can also perform wafer alignment separately from exposure apparatus 200, even if the wafer is a 450 mm wafer, for example, full-point EGA measurement becomes possible without causing a decrease in the throughput of exposure processing.

Note that although it is omitted in the drawings, in lithography system 1000, exposure apparatus 200 and C/D 300 may be connected in-line, and measurement device 100 may be placed on an opposite side to exposure apparatus 200 of C/D 300. In this case, measurement device 100 can be used for alignment measurement (hereinafter referred to as pre-measurement) similar to the previous description performed on, for example, wafers before resist coating. Or, measurement device 100 can be used for positional displacement measurement (overlay displacement measurement) of overlay displacement measurement marks described earlier to wafers that have completed development, or can also be used for pre-measurement and overlay displacement measurement.

Note that in the embodiment described above, to simplify the description, while either A-mode or B-mode was to be set as the measurement mode of measurement device 100, the embodiment is not limited to this, and modes may also be set such as a C-mode in which a first number of wafer marks being two or more are detected for all shots on all the wafers in a lot, and a mode in which for all wafers in the lot, a second number of wafer marks being two or more are detected for a part of the shots, e.g. shots decided in advance located in the peripheral section of the wafer, and as for the remaining shots, one wafer marks is detected for each shot (referred to as a D-mode). Furthermore, an E-mode may be provided in which according to the detection results of the wafer marks of the first predetermined number of wafers in the lot, any one of A-mode, C-mode, and D-mode is selected for the remaining wafers in the lot.

Also, as a measurement mode of measurement device 100, as for all wafers in the lot, one or more wafer marks may be measured for a part of the shots, e.g. the number of shots being 90% or 80%, or as for the shots located in the center of the wafer, one or more wafer marks may be measured for shots arranged spaced apart by one spacing.

Note that in measurement device 100 according to the embodiment above, while the case has been described where gratings RG1, RG2 a, and RG2 b each have periodic directions in the X-axis direction and the Y-axis direction, however, the embodiment is not limited to this, and the grating section (two-dimensional grating) that each of the first position measurement system 30 and the second position measurement system 50 are equipped with may have periodic directions which are in two directions that intersect each other within the XY plane.

Also, the structure of measurement device 100 described in the embodiment above is a mere example. For example, the measurement device may have any structure, as long as the device has a stage (slider 10) which is movable with respect to the base member (surface plate 12) and can measure the position information of a plurality of marks on the substrate (wafer) held on the stage. Accordingly, the measurement device, for example, does not necessarily have to be equipped with the first position measurement system 30 and the second position measurement system 50.

Also, it is a matter of course that the structure and arrangement of the detection points of head section 32 of the first position measurement system 30 described above in the embodiment is a mere example. For example, the position of the detection point of mark detection system MDS and the detection center of head section 32 does not have to coincide with each other in at least one of the X-axis direction and the Y-axis direction. Also, the arrangement of the head section and grating RG1 (grating section) of the first measurement system 30 may be reversed. That is, the head section may be provided at slider 10 and the grating section may be provided at surface plate 12. Also, the first position measurement system 30 does not necessarily have to be equipped with encoder system 33 and laser interferometer system 35, and the first position measurement system 30 may be structured only with the encoder system. The first position measurement system may be structured with an encoder system that irradiates a beam on grating RG1 of slider 10 from the head section, receives the return beam (diffraction beam) from the grating, and measures the position information in directions of six degrees of freedom of slider 10 with respect to surface plate 12. In this case, the structure of the head section does not matter in particular. For example, a pair of XZ heads that irradiates detection beams on two points the same distance apart in the X-axis direction with respect to a predetermined point on grating RG1 and a pair of YZ heads that irradiates detection beams on two points the same distance apart in the Y-axis direction with respect to the predetermined point may be provided, or a pair of three-dimensional heads that irradiates detection beams on two points distanced apart in the X-axis direction on grating RG1 and an XZ head or a YZ head that irradiates a detection beam on a point whose position in the Y-axis direction differs from the two points described above may be provided. The first position measurement system 30 does not necessarily have to be able to measure the position information in directions of six degrees of freedom of slider 10 with respect to surface plate 12, and for example, may be a system that can measure position information only in the X, the Y and the θz directions. Also, the first position measurement system may be placed in between surface plate 12 and slider 10.

Similarly, the structure of the second position measurement system 50 described in the embodiment above is a mere example. For example, head sections 52A and 52B may be fixed at the surface plate 12 side and scales 54A and 54B may be provided integral to mark detection system MDS. Also, while the example of the second measurement system 50 having the pair of head sections 52A and 52B was described, the embodiment is not limited to this, and the second measurement system 50 may only have one head section or have three or more head sections. In any case, it is preferable that the positional relation in directions of six degrees of freedom between surface plate 12 and mark detection system MDS can be measured by the second position measurement system 50. However, the second measurement system does not necessarily have to be able to measure the positional relation in all the directions of six degrees of freedom.

Note that in the embodiment above, the case has been described where drive system 20 for driving slider 10 with respect to surface plate 12 in a non-contact manner is structured, with slider 10 being supported by levitation on surface plate 12 by the plurality of air bearings 18, the system including the first driver 20A which moves slider 10 in the X-axis direction and the second driver 20B which moves slider 10 in the Y-axis direction integral with the first driver 20A. However, the embodiment is not limited to this, and as drive system 20, a drive system having a structure in which slider 10 is driven in directions of six degrees of freedom on surface plate 12 can be employed. Such a drive system, as an example, can be structured using a magnetic levitation type planar motor. In such a case, air bearings 18 will not be required. Note that measurement device 100 may be equipped with a drive system for moving surface plate 12, separately from vibration isolator 14.

Note that with lithography system 1000 in FIG. 1, while only one measurement device 100 was provided, a plurality of measurement devices, such as two, may be provided as in the following modified example.

FIG. 11 schematically shows a structure of lithography system 2000 according to a modified example. Lithography system 2000 is equipped with exposure apparatus 200, C/D 300, and two measurement devices 100 a and 100 b structured in a similar manner as measurement device 100 described earlier. Lithography system 2000 is installed in a clean room.

In lithography system 2000, two measurement devices 100 a and 100 b are arranged in parallel between exposure apparatus 200 and C/D 300.

Exposure apparatus 200, C/D 300, and measurement devices 100 a and 100 b that lithography system 2000 has are placed so that their chambers are adjacent to one another. Exposure controller 220 of exposure apparatus 200, coater/developer controller 320 of C/D 300, and controller 60 that measurement devices 100 a and 100 b each have are connected to one another via LAN500, and communicate with one another. Storage device 400 is also connected to LAN.

In lithography system 2000 according to the modified example, since an operation sequence similar to lithography system 1000 described earlier can be set, an effect equivalent to that of lithography system 1000 can be obtained.

Adding to this, in lithography system 2000, a sequence can be employed in which both measurement devices 100 a and 100 b are used in alignment measurement (hereinafter referred to as post-measurement) subject to the wafer after PB previously described, as well as in alignment measurement (pre-measurement) subject to the wafer before resist coating similar to the description earlier. In this case, since pre-measurement subject to a wafer is performed in parallel with the series of wafer processing previously described subject to a wafer different from the wafer undergoing pre-measurement, throughput of the whole system is hardly reduced. However, for the first wafer, the time for pre-measurement cannot be overlapped with the series of wafer processing.

By comparing the position actually measured in the pre-measurement and the position actually measured in the post-measurement for the same wafer mark on the same wafer, position measurement error of the wafer mark occurring due to resist coating can be obtained. Accordingly, by correcting the position of the same wafer mark actually measured on wafer alignment subject to the same wafer by exposure apparatus 200 only by the position measurement error obtained above of the wafer mark occurring due to the resist coating, EGA measurement with high precision canceling the measurement error of the position of the wafer mark occurring due to the resist coating becomes possible.

In this case, in both pre-measurement and post-measurement, since the measurement results of the position of the wafer mark are affected by the holding state of the wafer holder, it is preferable to employ the sequence in which pre-measurement and post-measurement are performed on the same wafer using the same measurement device 100 a or 100 b.

In lithography system 2000, instead of the pre-measurement described above, the overlay displacement measurement previously described may be performed on the wafer that has been developed. In this case, one predetermined measurement device of the measurement devices 100 a and 100 b may be used exclusively for post-measurement, and the other may be used exclusively for overlay displacement measurement. Or, a sequence may be employed in which post-measurement and overlay displacement measurement are performed by the same measurement device 100 a or 100 b for the same wafer. In the latter case, pre-measurement may also be performed by the same measurement device for the same wafer.

Although it is omitted in the drawings, in lithography system 2000, the one predetermined measurement device of the measurement devices 100 a and 100 b may be placed on an opposite side to exposure apparatus 200 with respect to C/D 300. In this case, measurement device 100 a is suitable for performing the overlay displacement measurement previously described on the wafer that has been developed, when considering the wafer carriage flow. Note that if the individual difference of the holding state of the holders between measurement devices 100 a and 100 b is hardly a problem, then measurement device 100 a may be used for pre-measurement instead of overlay displacement measurement, or may be used for both overlay displacement measurement and pre-measurement.

Other than this, in addition to exposure apparatus 200 and C/D 300, three or more devices of measurement device 100 may be provided, with all devices connected in-line, and of the three measurement devices 100, two may be used for pre-measurement and post-measurement, and the remaining one measurement device may be used exclusively for overlay displacement measurement. Of the former two measurement devices, one may be used exclusively for pre-measurement and the other exclusively for post-measurement.

Note that in the embodiment and the modified example described above (hereinafter shortened to as the embodiment above), the case has been described where signal processor 49 processes detection signals of mark detection system MDS equipped in measurement devices 100, 100 a, and 100 b and sends measurement results only of wafer marks whose waveform of detection signals obtained as the detection results of mark detection system MDS are favorable to controller 60, and controller 60 performs EGA operation using the measurement results of the wafer marks, and as a result, exposure controller 220 performs EGA operation using position information of apart of the position information of the wafer marks selected from a plurality of wafer marks whose waveforms of detection signals obtained as the detection results of mark detection system MDS are favorable. However, the embodiment and the modified example are not limited to this, and signal processor 49 may send to controller 60 measurement results of remaining wafer marks excluding the wafer marks whose waveforms of the detection signals obtained as the detection results of mark detection system MDS are defective. Also, judgment of whether the detection signals obtained as the detection results of mark detection system MDS are favorable or not may be performed by controller 60 instead of the signal processor, and also in this case, controller 60 performs the EGA operation described earlier using only the measurement results of the wafer marks whose detection signals are judged favorable or remaining wafer marks excluding the wafer marks whose detection signals are judged defective. Then, it is desirable that exposure controller 220 performs the EGA operation described earlier using the measurement results of the wafer marks partly selected from the measurement results of the wafer mark used in EGA operation by controller 60.

Note that in the embodiment above, while the example is described where measurement devices 100, 100 a, and 100 b are placed in between exposure apparatus 200 and C/D 300 instead of the in-line interface section, this is not limiting, and the measurement device (100, 100 a, 100 b) may be a part of the exposure apparatus. For example, the measurement device may be installed in the delivery section inside exposure apparatus 200 where the wafers before exposure are delivered. Also, in the case the measurement device (100, 100 a, 100 b) is installed inside the chamber of exposure apparatus 200 as a part of exposure apparatus 200, the measurement device may or may not have a chamber. Also, in the case the measurement device (100, 100 a, 100 b) is a part of the exposure apparatus, the measurement device may have a controller, or may not have a controller and can be controlled by the controller of the exposure apparatus. In any case, the measurement device is connected in-line with the exposure apparatus.

Note that in the embodiment above, while the case has been described where the substrate processing device is a C/D, the substrate processing device only has to be a device which is connected in-line with the exposure apparatus and the measurement device, and may be a coating apparatus (coater) that coats a sensitive agent (resist) on a substrate (wafer), a developing apparatus (developer) that develops the substrate (wafer) which has been exposed, or a coating apparatus (coater) and a developing apparatus (developer) which are each connected in-line with the exposure apparatus and the measurement device.

In the case the substrate processing device is a coating apparatus (coater), the measurement device can be used only for the post-measurement previously described, or for the pre-measurement and the post-measurement. In this case, the wafer after exposure is to be delivered to a developing apparatus which is not connected in-line with the exposure apparatus.

In the case the substrate processing device is a developing apparatus (developer), the measurement device can be used only for the post-measurement previously described, or for the post-measurement and the overlay displacement measurement. In this case, the wafer on which the resist is coated in advance at a different place is to be delivered to the exposure apparatus.

In the embodiment above, while the case has been described where the exposure apparatus is a scanning stepper, the case is not limiting, and the exposure apparatus may be a static type exposure apparatus such as a stepper or a reduction projection exposure apparatus of a step-and-stitch method that combines a shot area and a shot area together. The embodiment can furthermore be applied to a multi-stage type exposure apparatus that is equipped with a plurality of wafer stages, as is disclosed in, for example, U.S. Pat. No. 6,590,634, U.S. Pat. No. 5,969,441, U.S. Pat. No. 6,208,407 and the like. Also, the exposure apparatus is not limited to a dry type exposure apparatus previously described that performs exposure of wafer W directly without using liquid (water), and the exposure apparatus may be a liquid immersion type exposure apparatus that exposes a substrate via liquid as is disclosed in, for example, European Patent Application Publication No. 1420298, International Publication WO 2004/055803, International Publication WO 2004/057590, U.S. Patent Application Publication No. 2006/0231206, U.S. Patent Application Publication No. 2005/0280791, U.S. Pat. No. 6,952,253 and the like. Also, the exposure apparatus is not limited to an exposure apparatus used for manufacturing semiconductor devices, and may be, for example, an exposure apparatus for liquid crystals used for transferring a liquid crystal display device pattern onto a square glass plate.

Note that the disclosures of all publications, International Publications, U.S. Patent Application Publications, and U.S. Patents related to exposure apparatuses and the like referred to in the embodiment above are incorporated herein by reference as a part of the present specification.

Semiconductor devices are manufactured through exposing a sensitive object using a reticle (mask) on which a pattern is formed with an exposure apparatus that structures a lithography system according to the embodiment described above and through a lithography step in which the sensitive object that has been exposed is developed. In this case, highly integrated devices can be manufactured at high yield.

Note that other than the lithography step, the manufacturing process of semiconductor devices may include steps such as; a step for performing function/performance design of a device, a step for making a reticle (mask) based on this design step, a device assembly step (including a dicing process, a bonding process, and a package process), and an inspection step.

While the above-described embodiment of the present invention are the presently preferred embodiment thereof, those skilled in the art of lithography systems will readily recognize that numerous additions, modifications, and substitutions may be made to the above-described embodiment without departing from the spirit and scope thereof. It is intended that all such modifications, additions, and substitutions fall within the scope of the present invention, which is best defined by the claims appended below. 

What is claimed is:
 1. A substrate processing system in which a substrate having a plurality of divided areas formed along with at least a mark serves as a processing target, comprising: a measurement device that has a first stage which can hold the substrate, the device measuring position information of a plurality of the marks on the substrate held on the first stage; and an exposure apparatus that has a second stage on which the substrate having completed measurement of position information of the plurality of marks by the measurement device is mounted, the apparatus performing a measurement operation of measuring position information of part of the marks of the plurality of marks on the substrate mounted on the second stage and an exposure operation of exposing the plurality of divided areas with an energy beam, wherein the measurement device obtains a first information that includes a nonlinear deformation component of an arrangement of the plurality of divided areas on the substrate, using the position information of the plurality of marks that has been measured, and the exposure apparatus obtains a second information that includes a nonlinear deformation component of an arrangement of the plurality of divided areas on the substrate using the position information of part of the marks that has been measured, and controls position of the second stage on the exposure operation based on the first information obtained by the measurement device and the second information.
 2. The substrate processing system according to claim 1, wherein the measurement device obtains a relation between a correction amount with respect to a reference value of an arrangement of the plurality of divided areas on the substrate from a model formula consisting of a first polynomial expression of a predetermined degree using the position information of the plurality of marks, and the exposure apparatus obtains a relation between a correction amount with respect to a reference value of an arrangement of the plurality of divided areas on the substrate from a model formula consisting of a second polynomial expression of a predetermined degree using the position information of part of the marks.
 3. The substrate processing system according to claim 2, wherein the plurality of divided areas are formed on the substrate arranged in a matrix state, and the correction amount of the arrangement of the plurality of divided areas is obtained based on a model formula consisting of a polynomial expression related to X, Y where a relation between design position coordinates X, Y in each divided area and the correction amount of position coordinates of the divided area is expressed in a substrate coordinate system, the system being a two-dimensional orthogonal coordinate system whose axial directions are an X-axis direction being a row direction of the matrix and a Y-axis direction being a column direction of the matrix with a predetermined reference point on the substrate serving as an origin, and the measurement device obtains coefficients of higher-degree terms of the model formula by statistical calculation, and the exposure apparatus obtains coefficients of lower-degree terms of the model formula by statistical calculation, and obtains the correction amount of the arrangement of the plurality of divided areas on the substrate, based on a model formula after determination of coefficients where the coefficients of the lower-degree terms and the coefficients of the higher-degree terms have been substituted into the model formula.
 4. The substrate processing system according to claim 3, wherein the lower-degree term is a term having a degree of 1 or less, the higher-degree term is a term having a degree of two or more.
 5. The substrate processing system according to claim 3, wherein the exposure apparatus estimates deformation of the plurality of divided areas according to values obtained by partial differentiation of X and Y in the model formula after the determination of coefficients.
 6. The substrate processing system according to claim 5, wherein the exposure apparatus is a scanning exposure apparatus that relatively moves a mask stage on which a mask having a pattern formed is held synchronously with the second stage with respect to a projection optical system and transfers the pattern onto the divided areas on the substrate, and the exposure apparatus adjusts at least one of a relative scanning angle between the mask stage and the second stage, scanning speed ratio, relative position of at least one of the mask stage and the second stage with respect to the projection optical system, image forming characteristic of the projection optical system, and wavelength of exposure light during the scanning exposure so that a projection image of the pattern by the projection optical system is deformed in accordance with the estimated shape of the divided areas after deformation.
 7. The substrate processing system according to claim 3, wherein the exposure apparatus obtains a grid being the arrangement of the plurality of divided areas on the substrate based on the correction amount obtained and design values of the arrangement of the plurality of divided areas on the substrate, approximates the grid to a model formula of degree 1, and estimates deformation of the plurality of divided areas according to a coefficient of the model formula.
 8. The substrate processing system according to claim 7, wherein the exposure apparatus is a scanning exposure apparatus that relatively moves a mask stage on which a mask having a pattern formed is held synchronously with the second stage with respect to a projection optical system and transfers the pattern onto the divided areas on the substrate, and the exposure apparatus adjusts at least one of a relative scanning angle between the mask stage and the second stage, scanning speed ratio, relative position of at least one of the mask stage and the second stage with respect to the projection optical system, image forming characteristic of the projection optical system, and wavelength of exposure light during the scanning exposure so that a projection image of the pattern by the projection optical system is deformed in accordance with the estimated shape of the divided areas after deformation.
 9. The substrate processing system according to claim 1, wherein the measurement device comprises a mark detection system that detects the mark on the substrate held on the first stage and outputs a detection signal, a position measurement system that measures position information of the first stage with respect to the mark detection system, a drive system that moves the first stage, and a controller, while controlling movement by the drive system of the first stage based on the position information measured by the position measurement system, acquires the position information by the position measurement system and obtains position information of the plurality of marks on the substrate held on the first stage using the mark detection system.
 10. The substrate processing system according to claim 9, wherein the first stage is movable with respect to a base member, and, the position measurement system includes a first position measurement system that measures a first position information of the stage with respect to the base member and a second position measurement system that measures a second position information relative between the mark detection system and the base member.
 11. The substrate processing system according to claim 10, wherein the first position measurement system having one of a measurement surface that has a grating section and a head section that irradiates a beam on the measurement surface provided at the first stage measures the first position information of the first stage by irradiating a plurality of beams from the head section on the measurement surface and receiving return beams of each of the plurality of beams from the measurement surface.
 12. The substrate processing system according to claim 9, wherein the measurement device can set a plurality of measurement modes in which detection conditions by the mark detection system are mutually different.
 13. The substrate processing system according to claim 12, wherein the plurality of measurement modes include at least one mode of a first mode in which one mark each serves as a detection target for all divided areas on the substrate, a second mode in which two or more each of a first number of marks are detected for all divided areas on the substrate, and a third mode in which two or more each of a second number of marks are detected for part of the divided areas on the substrate and one mark each for remaining divided areas.
 14. The substrate processing system according to claim 12, wherein the plurality of measurement modes include a fourth mode in which one mode of the plurality of modes is set according to measurement results of a predetermined number of substrates including a first substrate which is processed first when continuously processing a plurality of number of substrates.
 15. The substrate processing system according to claim 12, wherein the plurality of measurement modes include a mode in which marks subject to detection by the mark detection system are decided for remaining substrates, based on measurement results of a predetermined number of substrates including a first substrate which is processed first when continuously processing a plurality of number of substrates.
 16. The substrate processing system according to claim 12, wherein when the measurement device measures position information of the plurality of marks for at least part of the divided areas of the plurality of divided areas on the substrate, the exposure apparatus measures position information of a plurality of marks selected from the plurality of marks on the substrate including the plurality of marks whose position information is measured by the measurement device for the at least part of the divided areas on the substrate when performing the measurement operation, performs calculation including the statistical calculation using the position information of the plurality of marks that has been measured, and estimates deformation of the divided areas.
 17. The substrate processing system according to claim 1, wherein the measurement device and the exposure apparatus are connected in-line.
 18. The substrate processing system according to claim 17, further comprising: a substrate processing device connected in-line with the measurement device and the exposure apparatus, the substrate processing device applying a predetermined processing to the substrate.
 19. The substrate processing system according to claim 18, wherein the substrate processing device is a coating apparatus that coats a sensitive agent on a substrate.
 20. The substrate processing system according to claim 19, wherein the measurement device is arranged between the exposure apparatus and the substrate processing device.
 21. The substrate processing system according to claim 19, wherein the measurement device is used in at least one of a pre-measurement in which measurement of position information of a plurality of marks on a substrate before the sensitive agent is coated is performed and a post-measurement in which measurement of position information of the plurality of marks on the substrate coated with the sensitive agent is performed.
 22. The substrate processing system according to claim 19, wherein the measurement device is provided in a plurality of numbers and a first measurement device and a second measurement device of the plurality of numbers of the measurement devices are arranged between the exposure apparatus and the substrate processing device, the first measurement device and the second measurement device are used in both of a pre-measurement in which measurement of position information of a plurality of marks on a substrate before the sensitive agent is coated is performed and a post- measurement in which measurement of position information of the plurality of marks on the substrate coated with the sensitive agent is performed, and the first measurement device and the second measurement device are used in the pre-measurement and the post-measurement to the same substrate when continuously processing a plurality of number of substrates.
 23. The substrate processing system according to claim 19, wherein the measurement device is provided in a plurality of numbers, and of the plurality of measurement devices a first measurement device is placed between the exposure apparatus and the substrate processing device, and of the plurality of measurement devices a second measurement device is placed at an opposite side of the exposure apparatus with the substrate processing device in between, the second measurement device is used in a pre-measurement in which measurement of position information of a plurality of marks on a substrate before the sensitive agent is coated is performed, and the first measurement device is used in a post-measurement in which measurement of position information of the plurality of marks on the substrate on which the sensitive agent has been coated is performed.
 24. The substrate processing system according to claim 18, wherein the substrate processing device is a coating developing apparatus that coats a sensitive agent on a substrate as well as develops the substrate after exposure.
 25. The substrate processing system according to claim 24, wherein the measurement device is arranged between the exposure apparatus and the substrate processing device.
 26. The substrate processing system according to claim 24, wherein the measurement device is used in at least one of a pre-measurement in which measurement of position information of a plurality of marks on a substrate before the sensitive agent is coated is performed and a post-measurement in which measurement of position information of the plurality of marks on the substrate coated with the sensitive agent is performed.
 27. The substrate processing system according to claim 24, wherein the measurement device is provided in a plurality of numbers and a first measurement device and a second measurement device of the plurality of numbers of the measurement devices are arranged between the exposure apparatus and the substrate processing device, the first measurement device and the second measurement device are used in both of a pre-measurement in which measurement of position information of a plurality of marks on a substrate before the sensitive agent is coated is performed and a post- measurement in which measurement of position information of the plurality of marks on the substrate coated with the sensitive agent is performed, and the first measurement device and the second measurement device are used in the pre-measurement and the post-measurement to the same substrate when continuously processing a plurality of number of substrates.
 28. The substrate processing system according to claim 24, wherein the measurement device is provided in a plurality of numbers, and of the plurality of measurement devices a first measurement device is placed between the exposure apparatus and the substrate processing device, and of the plurality of measurement devices a second measurement device is placed at an opposite side of the exposure apparatus with the substrate processing device in between, the second measurement device is used in a pre-measurement in which measurement of position information of a plurality of marks on a substrate before the sensitive agent is coated is performed, and the first measurement device is used in a post-measurement in which measurement of position information of the plurality of marks on the substrate on which the sensitive agent has been coated is performed.
 29. The substrate processing system according to claim 24 wherein the measurement device is used in both of a post-measurement in which measurement of position information of the plurality of marks on the substrate coated with the sensitive agent is performed and an overlay displacement measurement in which positional displacement amount of overlay displacement measurement marks on the substrate after development is measured.
 30. The substrate processing system according to claim 24 wherein the measurement device is provided in a plurality of numbers, and of the plurality of measurement devices a first measurement device is placed between the exposure apparatus and the substrate processing device, and of remaining measurement devices a second measurement device is placed at an opposite side of the exposure apparatus with the substrate processing device in between, the first measurement device is used in a post-measurement in which measurement of position information of the plurality of marks on the substrate on which the sensitive agent has been coated is performed, and the second measurement device is used in an overlay displacement measurement in which measurement of positional displacement amount of overlay displacement measurement marks on the substrate after development is performed.
 31. The substrate processing system according to claim 18, wherein the substrate processing device is a developing apparatus that develops the substrate after exposure.
 32. The substrate processing system according to claim 31, wherein the measurement device is arranged between the exposure apparatus and the substrate processing device.
 33. The substrate processing system according to claim 31 wherein the measurement device is used in both of a post-measurement in which measurement of position information of the plurality of marks on the substrate coated with the sensitive agent is performed and an overlay displacement measurement in which positional displacement amount of overlay displacement measurement marks on the substrate after development is measured.
 34. The substrate processing system according to claim 31 wherein the measurement device is provided in a plurality of numbers, and of the plurality of measurement devices a first measurement device is placed between the exposure apparatus and the substrate processing device, and of remaining measurement devices a second measurement device is placed at an opposite side of the exposure apparatus with the substrate processing device in between, the first measurement device is used in a post-measurement in which measurement of position information of the plurality of marks on the substrate on which the sensitive agent has been coated is performed, and second measurement device is used in an overlay displacement measurement in which measurement of positional displacement amount of overlay displacement measurement marks on the substrate after development is performed.
 35. The substrate processing system according to claim 18, further comprising: a temperature controlling device that performs temperature control on the substrate so that temperature of the substrate is in a predetermined range prior to being mounted on the stage of the measurement device.
 36. The substrate processing system according to claim 35, wherein the temperature controlling device is provided at a part of the substrate processing device.
 37. A device manufacturing method, including: exposing a substrate using the substrate processing system according to claim 1; and developing the substrate that has been exposed.
 38. A substrate processing method in which a substrate having a plurality of divided areas formed along with at least a mark serves as a processing target, comprising: measuring position information of a plurality the marks on the substrate held on a first stage; and performing a measurement operation of making the substrate that has completed measurement of position information of the plurality of marks by the measuring be held by a second stage, and measuring position information of part of the marks of the plurality of marks on the substrate held by the second stage, and an exposure operation of exposing the plurality of divided areas with an energy beam, wherein in the measuring, a first information concerning arrangement of the plurality of divided areas on the substrate including a nonlinear deformation component of the arrangement is obtained using the position information of the plurality of marks measured, and in performing the measurement operation and the exposure operation, a second information concerning arrangement of the plurality of divided areas on the substrate including a linear deformation component of the ‘arrangement using the position information of part of the marks measured, and based on the first information obtained by the measuring and the second information, controlling position of the second stage when performing the exposure operation.
 39. The substrate processing method according to claim 38, wherein the plurality of divided areas are formed on the substrate arranged in a matrix state, and correction amount with respect to a reference value of the arrangement of the plurality of divided areas is obtained based on a model formula consisting of a polynomial expression related to X, Y that expresses a relation between design position coordinates X, Y in each divided area in a substrate coordinate system which is a two-dimensional orthogonal coordinate system whose axial directions are an X-axis direction being a row direction of the matrix and a Y-axis direction being a column direction of the matrix with a predetermined reference point on the substrate serving as an origin and correction amount of position coordinates of the divided area, and in the measuring, coefficients of higher-degree terms of the model formula are obtained by statistical calculation, and in performing the measurement operation and the exposure operation, coefficients of lower-degree terms of the model formula is obtained by statistical calculation, and the correction amount of the arrangement of the plurality of divided areas on the substrate is obtained, based on a model formula after determination of coefficients where the coefficients of the lower-degree terms and the coefficients of the higher-degree terms have been substituted into the model formula.
 40. The substrate processing method according to claim 39, wherein the lower-degree term is a term having a degree of 1 or less, and the higher- degree term is a term having a degree of two or more.
 41. A measurement device used in a substrate processing system in which a substrate having a plurality of divided areas are formed along with at least a mark serves as a processing target, and which comprises an exposure apparatus that has a first stage where the substrate is placed and performs a measurement operation of measuring position information of some marks of a plurality of the marks on the substrate placed on the first stage and an exposure operation of exposing the plurality of divided areas with an energy beam, the measurement device comprising: a second stage capable of holding the substrate; a measurement system that measures position information of the plurality of marks on the substrate held by the second stage; a calculation section that obtains first information concerning an arrangement of the plurality of divided areas on the substrate including a nonlinear deformation component of the arrangement, using the position information of the plurality of marks that has been measured; and an output section that outputs the first information so that exposure apparatus obtains second information concerning an arrangement of the plurality of divided areas on the substrate including a linear deformation component of the arrangement using the position information of the some marks that has been measured, and controls a position of the second stage when performing in the exposure operation based on the first information and the second information.
 42. The measurement device according to claim 41, wherein the calculation section obtains a relation of a correction amount with respect to a reference value of the arrangement of the plurality of divided areas on the substrate from a model formula consisting of a first polynomial expression of a predetermined degree using the position information of the plurality of marks, and the exposure apparatus obtains a relation of a correction amount with respect to a reference value of the arrangement of the plurality of divided areas on the substrate from a model formula consisting of a second polynomial expression of a predetermined degree using the position information of the some marks.
 43. The measurement device according to claim42, wherein the plurality of divided areas are formed on the substrate arranged in a matrix state, and the correction amount of the arrangement of the plurality of divided areas is obtained based on a model formula consisting of a polynomial expression related to X, Y where a relation between design position coordinates X, Y in each divided area and the correction amount of position coordinates of the divided area is expressed in a substrate coordinate system, the substrate coordinate system being a two-dimensional orthogonal coordinate system whose axial directions are an X-axis direction being a row direction of the matrix and a Y-axis direction being a column direction of the matrix with a predetermined reference point on the substrate serving as an origin, and the calculation section obtains coefficients of higher-degree terms of the model formula by statistical calculation, and the exposure apparatus obtains coefficients of lower-degree terms of the model formula by statistical calculation, and obtains the correction amount of the arrangement of the plurality of divided areas on the substrate, based on a model formula after determination of coefficients where the coefficients of the lower-degree terms and the coefficients of the higher-degree terms have been substituted into the model formula.
 44. The measurement device according to claim 43, wherein the lower-degree term is a term having a degree of 1 or less, and the higher- degree term is a term having a degree of two or more.
 45. The measurement device according to claim 43, wherein the exposure apparatus estimates deformation of the plurality of divided areas according to values obtained by partial differentiation of X and Y in the model formula after the determination of coefficients.
 46. The measurement device according to claim 43, wherein the exposure apparatus obtains a grid being the arrangement of the plurality of divided areas on the substrate based on the correction amount obtained and design values of the arrangement of the plurality of divided areas on the substrate, approximates the grid to a model formula of degree 1, and estimates deformation of the plurality of divided areas according to a coefficient of the model formula.
 47. The measurement device according to claim 41, further comprising: a mark detection system that detects the mark on the substrate held on the first stage and outputs a detection signal; a position measurement system that measures position information of the first stage with respect to the mark detection system; a drive system that moves the first stage; and a controller, while controlling movement by the drive system of the first stage based on the position information measured by the position measurement system, acquires the position information by the position measurement system and obtains position information of the plurality of marks on the substrate held on the first stage using the mark detection system.
 48. The measurement device according to claim 47, wherein the first stage is movable with respect to a base member, and, the position measurement system includes a first position measurement system that measures a first position information of the stage with respect to the base member and a second position measurement system that measures a second position information relative between the mark detection system and the base member.
 49. The measurement device according to claim 48, wherein in the first position measurement system one of a measurement surface that has a grating section and a head section that irradiates the measurement surface with a beam is provided at the first stage, and the first position measurement system measures the first position information of the first stage by irradiating the measurement surface with a plurality of beams from the head section and receiving a return beam of each of the plurality of beams from the measurement surface.
 50. The measurement device according to claim 47, wherein a plurality of measurement modes in which detection conditions by the mark detection system are mutually different can be set.
 51. The measurement device according to claim 50, wherein the plurality of measurement modes include at least one mode of a first mode in which one mark each serves as a detection target for all divided areas on the substrate, a second mode in which two or more each of a first number of marks are detected for all divided areas on the substrate, and a third mode in which two or more each of a second number of marks are detected for part of the divided areas on the substrate and one mark each is detected for remaining divided areas.
 52. The measurement device according to claim 51, wherein the plurality of measurement modes include a fourth mode in which one mode of the plurality of modes is set according to measurement results of a predetermined number of substrates including a first substrate which is processed first when continuously processing a plurality of number of substrates.
 53. The measurement device according to claim 50, wherein the plurality of measurement modes include a mode in which, based on measurement results of a predetermined number of substrates including a first substrate which is processed first, marks subject to detection by the mark detection system are decided for remaining substrates, when a plurality of number of substrates are continuously processed.
 54. The measurement device according to claim 50, wherein when the measurement system measures position information of the plurality of marks for at least part of the divided areas of the plurality of divided areas on the substrate, the exposure apparatus measures position information of multiple marks selected from the plurality of marks on the substrate including the plurality of marks whose position information has been measured by the measurement system for the at least part of the divided areas on the substrate when performing the measurement operation, performs calculation including statistical calculation using the position information of the plurality of marks that has been measured, and estimates deformation of the divided areas.
 55. The measurement device according to claim 41, wherein the calculation section is connected in-line to the exposure apparatus. 